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SEQ ID 1 D58-BG7 

1 GCACAACTT GCTATCAACT TGGTCACATC TATGTTGGGT 
61 CATTTGTTGC ATCATTTTAC ATGGGCTCCG GCCCCGGGGG TTAACCCGGA GGATATTGAC 
121 TTGGAGGAGA GCCCTGGAAC AGTAACTTAC ATGAAAAATC CAATACAAGC TATTCCAACT 
181 CCAAGATTGC CTGCACACTT GTATGGACGT GTGCCAGTGG ATATGTAA 
SEQ ID 2 

AQLAINLVTSMLGHLLHHFTWAPAPGVNPEDIDLEESPGTVTYMKNPIQAI PTPRLPAHLYGRVPVDM 
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SEQ ID 3 D58-AB1 

1 GCACAACT TGCTATCAAC TTGGTCACAT CTATGTTGGG 
61 TCATTTGTTG CATCATTTTA CGTGGGCTCC GCCCCGGGGG GTTAACCCGG AGAATATTGA 
121 CTTGGAGGAG AGCCCTGGAA CAGTAACTTA CATGAAAAAT CCAATACAAG CTATTCCTAC 
181 TCCAAGATTG CCTGCACACT TGTATGGACG TGTGCCAGTG GATATGTAA 
SEQ ID 4 

AQLAINLWSMLGHLLHHFTWAPPPGWPENIDLEESPGTVTYMKNPIQAI PTPRLPAHLYGRVPVDM 
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SEQ ID 5 D186-AH4 

1 ATGAATTAT TCATTGCAAG TGGAACACCT TTCAATTGCT 
61 CATATGATCC AAGGTTTCAG TTTTGCAACT ACGACCAATG AGCCTTTGGA TATGAAACAA 
121 GGTGTGGGTT TAACTTTACC AAAGAAGACT GATGTTGAAG TGCTAATTAC ACCTCGCCTT 
181 CCTCCTACGC TTTATCAATA TTAA 
SEQ ID 6 

MNYSLQVEHLSIAHMIQGFSFATTTNEPLDMKQGVGLTLPKKTDVEVLITPRLPPTLYQY 
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SEQ ID 7 D58-BE4 

1 GCACAACTT GCTATCAACT TGGTCACATC TATGTTGGGT 
61 CATTTGTTCA TCATTTTACA TGGGCTCCGG CCCCGGGGGT TAACCCGGAG GATATTGACT 
121 TGGAGGAGAG CCCTGGAACA GTAACTTACA TGA 
SEQ ID 8 

AQLAINLVTSMLGHLFIILHGLRPRGLTRRILTWRRALEQ 
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SEQ ID 9 D56-AH7 

1 GAAGGATTG GCTGTTCGAA TGGTTGCCTT GTCATTGGGA 
61 TGTATTATTC AATGTTTTGA TTGGCAACGA ATCGGCGAAG AATTGGTTGA TATGACTGAA 
121 GGAACTGGAC TTACTTTGCC TAAAGCTCAA CCTTTGGTGG CCAAGTGTAG CCCACGACCT 
181 AAAATGGCTA ATCTTCTCTC TCAGATTTGA 



SEQ ID 10 

EGIA\^WALSLGCIIQCFDWQRIGEELVDMTEGTGLTLPKAQPLVAKCSPRPKMANLLSQI 
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SEQ ID 11 D13a-5 

1 GAAGGATTG GCTATTCGAA TGGTTGCATT GTCATTGGGA 

61 TGTATTATTC AATGCTTTGA TTGGCAACGA CTTGGGGAAG GATTGGTTGA TAAGACTGAA 

121 GGAACTGGAC TTACTTTGCC TAAAGCTCAA CCTTTAGTGG CCAAGTGTAG CCCACGACCT 

181 ATAATGGCTA ATCTTCTTTC TCAGATTTGA 
SEQ ID 12 

EGLAIRMVALSLGCIIQCFDWQRLGEGLVDKTEGTGLTLPKAQPLVAKCSPRPIMANLLSQI 
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SEQ ID 13 D56-AG10 

1 ATAGGTTTT GCGACTTTAG TGACACATCT GACTTTTGGT 

61 CGCTTGCTTC AAGGTTTTGA TTTTAGTAAG CCATCAAACA CGCCAATTGA CATGACAGAA 

121 GGCGTAGGCG TTACTTTGCC TAAGGTTAAT CAAGTTGAAG TTCTAATTAC CCCTCGTTTA 

181 CCTTCTAAGC TTTATTTATT TTGA 
SEQ ID 14 

IGFATLVTHLTFGRLLQGFDFSKPSNTPIDMTEGVGVTLPKVNQVEVLITPRLPSKLYLF 
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SEQ ID 15 D35-33 

1 ATAGGCTTT GCGACTTTAG TGACACATCT GACTTTTGGT 

61 CGCTTGCTTC AAGGTTTTGA TTTTAGTAAG CCATCAAACA CGCCAATTGA CATGACAGAA 

121 GGCGTAGGCG TTACTTTGCC TAAGGTTAAT CAAGTTGAAG TTCTAATTAC CCCTCGTTTA 

181 CCTTCTAAGC TTTATTTAT 
SEQ ID 16 

IGFATLVTHLTFGRLLQGFDFSKPSNTPIDMTEGVGVTLPKVNQVEVLITPRLPSKLYL 
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SEQ ID 17 D34-62 

1 ATAAATTTT GCGACTTTAG TGACACATCT GACTTTTGGT 

61 CGCTTGCTTC AAGGTTTTGA TTTTAGTACG CCATCAAACA CGCCAATAGA CATGACAGAA 

121 GGCGTAGGCG TTACTTTGCC TAAGGTAAAT CAAGTGGAAG TTCTAATTAG CCCTCGTTTA 

181 CCTTCTAAGC TTTATGTATT CTGA 
SEQ ID 18 

INFATLVTHLTFGRLLQGFDFSTPSNTPIDMTEGVGVTLPKVNQVEVLISPRLPSKLYVF 
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SEQ ID 19 D56AA7 

1 ATTATACTT GCATTGCCAA TTCTTGGCAT CACTTTGGGA 
61 CGTTTGGTTC AGAACTTTGA GCTGTTGCCT CCTCCAGGCC AGTCGAAGCT CGACACCACA 
121 GAGAAAGGTG GACAGTT CAG TCTCCACATT TTGAAGCATT CCACCATTGT GTTGAAACCA 
181 AGGTCTTTCT GA 
SEQ ID 20 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVLKPRSF 
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SEQ ID 21 D56-AE1 

1 ATTATACTT GCATTGCCAA TTCTTGGCAT TACTTTGGGA 
61 CGTTTGGTTC AGAACTTTGA GCTGTTGCCT CCTCCAGGCC AGTCGAAGCT CGACACCACA 
121 GAGAAAGGTG GACAGTTCAG TCTCCATATT TTGAAGCATT CCACCATTGT GTTGAAACCA 
181 AGGTCTTGCT GA 
SEQ ID 22 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVLKPRSC 
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SEQ ID 23 D35-BB7 

1 TATTGCACTT GGGGTTGCAT CAAT GGAACT TGCATTGTCA 
61 AATCTTCTTT ATGCATTTGA TTGGGAGTTA CCTTTTGGAA TGAAAAAAGA AGACATT GAC 
121 ACAAACGCCA GGCCTGGAAT TACCATGCAT AAGAAAAACG AACTTTATCT TATCCCTAAA 
181 AATTATCTAT AG 

IA^GVASMELALSNLLYAFDWELPFGMK^ 



FIG. 13 

SEQ ID 25 D177-BA7 

1 ATTGCACTTG GGGTTGCATC CATGGAACTT 
121 GCTTTGTCAA ATCTTCTTTA TGCATTTGAT TGGGAGTTAC CTTACGGAGT GAAAAAAGAA 
181 AACATTGACA CAAATGTCAG GCCTGGAATT AC CAT G CAT A AGAAAAACGA ACTTTGCCTT 
241 ATCCCTAGAA ATTATCTATA G 
SEQ ID 26 

IALGVASMELALSNLLYAFDWELPYGVKKENIDTNVRPGITMHKKNELCLIPRNYL 
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SEO ID 27 D56A-AB6 

1 GGTATTGCAC TTGGGGTTGC ATCCATGGAA CTTGCTTTGT CAAATCTTCT TTATGCATTT 
61 GATTGGGAGT TGCCTTATGG AGTGAAAAAA GAAGACATCG ACACAAACGT TAGGCCTGGA 
121 ATTGCCATGC ACAAGAAAAA CGAACTTTGC CTTGTCCCAA AAAATTATTT ATAA 
SEQ ID 28 



SEQ ID 28 

IALGVASMELALSNLLYAFDWELPYGVKKEDIDTNVRPGIAMHKKNELCLVPKNYL 
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SEQ ID 29 D144-AE2 

1 ATT GCACTTGGGG TTGCATCCAT GGAACT TGCT *,-»r 
61 TTGTCAAATC TTCTTTATGC ATTTGATTGG GAGTTGCCTT ATGGAGTGAA AAAAGAAGAC 
121 AT CGACACAA ACGTTAGGCC TGGAATTGCC ATGCACAAGA AAAAC GAACT TTGCCTTGTC 
181 CCAAAAAAAT TATTTATAAA TTATATTGGG ACGTGGATCT CATGCTAG 



SEQ ID 30 

IALGVJ 



D 30 

/ASMELALSNLLYAFDWELPYGVKKEDIDTNVRPGIAMHKKNELCLVPKKLFINYIGTWISC 
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<5i?o ID 31 D56-AG11 

1 attTCGTTT GGTTTAGCTA ATGCTTATTT GCCATTGGCT mrrJk __ rrACTTGGAC 

i Bsss 5SSS SSSS SEES SS 35SSS 

181 CCTTATCAAC CTCCTCAAAA CTGA 
ISFGIANAYLPIAQLLYHFDWELPTGIKPSDIjDLTELVGWAARKSDLYLVATPYQPPQN 



fig. n 



SEQ ID 33 ^"^TAGCTA ATGCTTATTT GCCATTGGCT rGACTTGGAC 

i SES5 ssss ssss sees sss ss 

181 CCTTATCAAC CTCCTCAAAA GTGA 
I S FG1ANAYLPLAQLLYHFDWKLPAGI EP S DLDLTELVGVTAARKS DLYLVAT PYQP PQK 
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361 GGA 
SEQ ID 36 



MLFG1ANVGQPIAQLLYHFDWKLPNGQSHENFDMTESPGISATRKDDLVLIATPYDSY 
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SEQ ID 37 D144-AD1 



i = = =ss 

181 ATTTTGATTG CCACTCCTGC TCATTCTTGA 
MLFG1ANVGQPLAQLLYHFDWKLPNGQTHQNFDMTESPGISATRKDDLILIATPAHS 
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SEQ ID 39 D144-AB5 



i SSSS SEE =S 

181 TGTTTAATTG CTACTCCTTT TGGTCTCAAT TAA 

l!fgl^ghpiaqllyhfdwktlpgissdsfdmtetdgvtagrkddlcliatpfgln 
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SEQ ID 41 D181-AB5 

1 A TGTCGTTTGG TTTAGTTAAC ACTGGGCATC CTTTAGCTCA 
61 GTTGCTCTAT TTCTTTGACT GGAAATTCCC TCATAAGGTT AATGCAGCTG ATTTTCACAC 
121 TACTGAAACA AGTAGAGTTT TTGCAGCAAG CAAAGATGAC CTCTACTTGA TTCCAACAAA 

181 tcacatggag caagagtag 
ms^wtLpiaqllyffdwkfphkvnaadfhttetsrvfaaskddlyliptnhmeq 
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SEQ ID 43 D73-AC9 

1 AT GTCGTTTGGT TTAGTTAACA CAGGGCATCC TTTAGCCCAG 
121 TTGCTCTATT GCTTTGACTG GAAACTCCCT GACAAGGTTA ATGCAAATGA TTTTCGCACT 
181 ACTGAAACAA GTAGAGTTTT TGCAGCAAGC AAAGATGACC TCTACTTGAT TCCCACAAAT 
241 CACAGGGAGC AAGAATAG 
SEQ ID 4 4 

MSFGLVNTGHPLAQLLYCFDWKLPDKVNANDFRTTETSRVFAASKDDLYLIPTNHREQE 
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SEQ ID 45 D56-AC12 

1 ATGCAATTT GGTTTGGCTC TTGTTACTCT GCCATTGGCT 
61 CATTTGCTTC ACAATTTTGA TTGGAAACTT CCCGAAGGAA TTAATGCAAG GGATTTGGAC 
121 ATGACAGAGG CAAATGGGAT ATCTGCTAGA AGAGAAAAAG ATCTTTACTT GATTGCTACT 
181 CCTTATGTAT CACCTCTTGA TTAA 

MQFG^LWLPLAHLLHNFDWKLPEGINARDLDMTEANGISARREKDLYLIATPYVSPLD 
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SEQ ID 47 D58-AB9 

1 ATGACTTAT GCATTGCAAG TGGAACACCT AACAATGGCA 
61 CATTTGATCC AGGGTTTCAA TTACAGAACT CCAACTGATG AGCCCTTGGA TATGAAAGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGTGAAAG TGATAATTAC GCCTCGCTTG 
181 GCACCTGAGC TTTATTAA 

^yalqvehltmahl^^ 
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SEQ ID 49 D56-AG9 

1 ATGACTTAT GCATTGCAAG TGGAACACCT AACAATGGCA 
61 CATTTAATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGGCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGTGGAAC TGATAATAGC GCCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 50 

mtyalqvehltmahliqgfnyktpndealdmkegagitirkvnpveliiaprlapely 
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SEQ ID 51 D56-AG6 

1 ATGACTTAT GCATTGCAAG TGGAACACCT AACAATGGCA 
61 CATTTAATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGGCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TaIcAATACG TAAGGTAAAT CCAGTGGAAT TGATAATAAC GCCTCGCTTG 

181 gcacctgagc tttactaa 
otya^veLtmahliqgfnyktpn^ 
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SEQ ID 53 D35-BG11 

1 ATGACTTAT GCATTGCAAG TGGAACACTT AACAATGGCA 
61 CATTTGATCC AAGGTTTCAA TTACAGAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGTGGAAC T GATAATAGC GCCTCGCCTG 
181 GCACCTGAGC TTTATTAA 

^YALQVEHLTMAHLIQGFNYRTPNDEPLDMKEGAGITI RKVNPVELI IAPRLAPELY 
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SEQ ID 55 D35-42 

1 ATGACTTAT GCATTGCAAG TGGAACACTT AACAATGGCA 
61 CATTTGATCC AAGGTTTCAA TTACAGAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGTGGAAC T GATAATAGC GCCCCTGGCA 
181 CCTGAGCTTT ATTAA 
SEQ ID 56 

MTYALQVEHLTMAHLIQGFNYRTPNDEPLDMKEGAGITIRKVNPVELIIAPLAPELY 
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SEQ ID 57 D35-BA3 

1 ATGACTTAT GCATTGCAAG TGGAACACTT AACAATGGCA 
61 CATTTGATCC AAGGTTTCAA TTACAGAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGCGGAAC T GATAATAGC GCCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 58 

MTYALQVEHLTMAHLIQGFNYRTPNDEPLDMKEGAGITIRKVNPAELIIAPRLAPELY 
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SEQ ID 59 D34-57 

1 ATGACTTAT GCATTACAAG TGGAACACCT AACAATAGCA 
61 CATTTGATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGAT TAACCATACG TAAAGTAAAT CCTGTAGAAG TGACAACTAC GGCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 60 

MT YALQ VEH LT I AHL I Q G FN YKT PN D EP LDMKE GAGLT I RKVN P VEVT TT ARLAP E L Y 
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SEQ ID 61 D34-52 

A ^^^^^S^^ AGCCCTTGGA TATGAAGGAA 
m GGTGCAGGAT W TAAAGTAAAT CCTGTAGAAG TGACAATTAC GGCTCGCCTG 

181 gcacctgagc tttattaa 
ot?mqwLtiahliqgfnyktpndepldmkegagltirkvnpvevtitarlapely 
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SEQ ID 63 °'*_?? TTACAAG XGGAACACCT AACAATAGCA «.-_«» 

61 £atStCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGCCCCTGGA TATGAAGGAA 

121 GGTGCMGAT TAACTATACG TAAAGTAAAT CCTGTAGAAG TGACAATTAC GGCTCGCCTG 

181 GCACCTGAGC TTTATTAA 
1^ YALQVEHLT I AHLI QGFN YKT PNDEP LDMKEGAGLT I RKVN PVEVT I TARLAPEL Y 



FIG. 33 



SEO ID 65 D56AD10 

1 TATAGCCTT GGACTTAAGG TTATCCGAGT AACATTAGCC ^^^^^ 
61 AACATGTTGC ATGGATTCAA CTGGAAATTA CCTGAAGGTA T GAAGC CAGA AGATATAAGT 
121 ^£Sc ATTATGGGCT CACTACACAT CCTAAGTTTC CTGTTCCTGT GATCTTGGAA 
181 TCTAGACTTT CTTCAGATCT CTATTCCCCC ATCACTTAA 

ysLl^rvti^lhg^ 
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SEQ ID 67 D56-AA11 

6 i SSSSS SSES SS5SS SSS tgagtccaga aga = c 
121 !S tttatgggct aattacacac cccaaagtcg cacttgacgt gatgatggag 
181 cctcgacttc ccaaccatct ttacaaatag 

ysLiriiratuw^ 
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SEQ ID 69 D177-BD5 

i ATTAATTTTT CAATACCACT TGTTGAGCTT 

121 gScttgcta atctattgtt tcattataat tggtcacttc ctgaagggat gctagctaag 
m gatgttgata tgSagaagc tttggggatt accatgcaca agaaatctcc cctttgctta 

241 GTAGCTTCTC ATTATACTTG TTGA 
INFSIPL^LALANLLFHYNWSLPEGMLAKDVDM 



FI . 36 



SEQ ID 7 \ ATGCAACTTG A GGCT?TATGC ATTGGAAATG GCTGTGGCCC ATCTTCTTCA TTGTTTTACT 
6 CAGATGGTAT GAAACCAAGT GAGCTTXAAA TGGATGATAT ™GACTC 

121 ACTGCTCCAA AAGCTAATCG ACTCGTGGCT GTGCCTACTC CACGTTTGTT GTGTCCCCTT 
181 TATTAATTGA 

MQLGLYALEMAVAHLLHCFTWELPDGMKPSELKMDDIFGLTAPKANRLVAVPTPRLLCPLY 
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SEO ID 73 58-BC5 

1 ATGCAACTT GGGCTTTATG CATTAGAAAT GGCAGTGGCC 
fil CATCTTCTTC TTTGCTTTAC TTGGGAATTG CCAGATGGTA TGAAACCAAG TGAGCTTAAA 
121 «S tttttgSct CACTGCTCCA AGAGCTAATC GACTCGTGGC TGTGCCTAGT 
181 CCACGTTTGT TGTGCCCACT TTATTAA 

MQLGL YALEMAVAHLLLC FTWEL P D GMKP S E LKMD D I FGLT AP RAN RLVAVP S P RLLCP L Y 
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SEQ ID 75 D58-AD12 

1 ATGCAACTT GGGCTTTATG CATTGGAAAT GGCTGTGGCC 
fil CAT CTT CTT C ATT GTTT T AC TTGGGAATTG CCAGATGGTA TGAAACCAAG TGAGCTTAAA 
121 aSS StTGgIcT CACTGCTCCA AGAGCTAATC GACTCGTGGC TGTGCCTACT 
181 CCACGTTTGT TGTGTCCCCT TTATTAA 

MQLGLYALEMAVAHLLHCFTWELP 
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SEQ ID 77 D56-AC11 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACTTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT TCAGAGTAGC ATGA 
SEQ ID 78 

MLWSAS I VRVS YLTCI YRFQVYAGSVFRVA 
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SEQ ID 79 D35-39 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACTTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT TCAGAGTAGC ATGA 

SEQ ID 80 

MLWSAS I VRVS YLTC I YRFQVYAG SVFRVA 
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SEQ ID 81 D58-BH4 

1 ATGCTTTGG AGT GCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACCTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT TCAGAGTAGC ATGA 
SEQ ID 82 

MLW S AS I VRVS YLT C I Y RFQVYAG S VFRVA 
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<5EO ID 83 D177-BD7 

1 ATTAATTTTT CAATACCACT TGTTGAGCTT GCACTTGCTA ATCTATTGTT TCATTATAAT 
61 TGGTCACTTC CTGAGGGGAT GCTACCTAAG GATGTTGATA TGGAAGAAGC TTTGGGGATT 
121 ACCATGCACA AGAAATCTCC CCTTTGCTTA GTAGCTTCTC ATTATAACTT GTTGTGA 

INFSIPL^IAIANLLFHYNWSLPEGMLPKDVDMEEALGI 
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SEQ ID 85 

D176-BF2 

1 AT ATCATTTGGT TTGGCTAATG TTTATTTGCC ACTAGCTCAA 
121 TTGTTATATC ATTTTGATTG GAAACTCCCT ACTGGAATCA ATTCAAGTGA CTTGGACATG 
181 ACTGAGTCGT CAGGAGTAAC TTGTGCTAGA AAGAGTGATT TATACTTGAC TGCTACTCCA 
241 TATCAACTTT CTCAAGAGTG A 
SEQ ID 86 

GISFGLANVYLPLAQLLYHFDWKLPTGINSSDLDMTESSGVTCARKSDLYLTATPYQLSQE 
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SEQ ID 87 D56-AD6 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACTTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT CCAGAGTAGC ATGA 
SEQ ID 88 

MLWSAS I VRVS YLTCI YRFQVYAGSVS RVA 
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SEQ ID 89 D73A-AD6 

1 CT GAATTTTGCA ATGTTAGAGG CAAAAATGGC ACTTGCATTG 
121 ATTCTACAAC ACTATGCTTT TGAGCTCTCT CCATCTTATG CACATGCTCC T CAT ACAAT T 
181 ATCACTCTGC AACCTCAACA TGGTGCTCCT TTGATTTTGC GCAAGCTGTA G 
SEQ ID 90 

LNFAMLEAKMALALILQHYAFELSPSYAHAPHTIITLQPQHGAPLILRKL 
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SEO ID 91 D70A-BA11 

1 CT GAATTTTGCA ATGTTAGAGG CAAAAATGGC ^TTGCMTG TCATACAATT 
121 ATTCTACAAC ACTATGCTTT TGAGCTCTCT CCATCTTATG CACACGCTCC TCATACAAT 1 
ill SSS? AACCTCAACA TGGTGCTCCT TTGATTTTGC GCAAGCTGTA G 



SEQ ID 92 

LNF 



) ID 92 

Jamleakmalalilqhyafelspsyahaphtiitlqpqhgaplilrkl 
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SEO ID 93 D70A-BB5 

1 AA TAATTTTGCA ATGTTGGAAA CTAAGATTGC CTTAGCAATG ^^^^ 
m\ MCC^ACAGC gttttgcttt CGAGCTTTCT ccatcttacg CTCATGCACC TACTTATGTC 
ill Scctcagtg tggtgctcac ttaatcttgc aaaaattata ggtccttaat 

241 CTGGATTTCC cattattgag tagtgcctaa taaatcttct ctatcactat ttttccatct 

301 TTCA 
SEQ ID 94 

NNFAMLETKIAIAMILQRFAFELSPSYAHAPTYVVTLRPQCGAHLIIjQKL 
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SEQ ID 95 ^ll^l^ GGC aacAAAGGGG AAAATGACAT ATTTTCCATT TGGTGCAGGA 

61 CCGCGAAAAT OCATTGGOCA AAACTTCGCG ATTTTGGAAG CAAAAATGGC TATAGCTATG 
121 ATTCTACAAC GCTTCTCCTT CGAGCTCTCC CCATCTTATA CACACTCTCC ATACACTGTG 
\ll GTCACTTTGA AACCCAAATA TGGTGCTCCC CTAATAATGC ACAGGCTGTA GTCCTGTGAG 
241 AATATGCTAT CCGAGGAATT CAGTTCCT 

SEQ ID 96 

QNFAILEAKMAIAMILQRFSFELSPSYTHSPYTVVTLKPKYGAPLIMHRL 
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SEQ ID 97 ^f^^^ aacAAAGGGG AAAATGACAT ATTTTCCATT TGGTGCAGGA 

61 CCGCGAAAAT GCATTGGGCA AAACTTCGCG ATTTTGGAAG CAAAAATGGC TATAGCTATG 
121 ATTCTACAAC GCTTCTCCTT CGAGCTCTCT CCATCTTATA CACACTCTCC ATACACTGTG 
181 GTCACTTTGA AACCCAAATA TGGTGCTCCC CTAATAATGC ACAGGCTGTA GTCCTGT 

SEQ ID 98 

QN FAI L EAKMAI AMI LQRFSFELSPSYTHSP YT WT LKP KYGAP L IMH RL 
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1 C AAAATTTTGC CATGTTAGAA GCAAAGATGG CTCTGTCTAT GATCCTGCAA 
121 CG^TCTCTT TTGAACTGTC TCCGTCTTAT GCACATGCCC CTCAGTCCAT ATTAACCGT 
181 CAGCCACAAT ATGGTGCTCC ACTTATTTTC CACAAGCTAT AA 
SEQ ID 100 

QNFAMLEAKMALSMILQRFSFELSPSYAHAPQSILTVQPQYGAPLIFHKL 
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SEQ ID 101 D70A-BH2 



181 ATAACTMGC AACCCCAATA TGGTGCTCCT CTTATATTGC ACAAATTGTA A 



in?weaSL4amilqrfsfelspsythapqsvitmqpqygaplilhkl 
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SEQ ID 103 D10A-AA4 



12 i SSSZSSSSEZS^^ acagtctgta 

m ATAACTATGC AACCCCAATA TGGTGCTCCT CTTATATTGC ACAAATTGTA A 



?NFA^AEAKMAMAMILQRFSFELSPSYTHAPQSVITMQPQVGAPLILHKL 
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SEQ ID 105 D70A-BA1 



il A^SL^ ATTTGCAATT 
"l GTGACTATTC ATCCTCAGTA TGGTGCTCCT CTGCTTATGC GCAGACTTTA A 



q™ea^avamilqkfsfelspsythapfaivtihpqygapllmrrl 
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SEQ ID 107 D70A-BA9 



" Z£252S£££^^ ATTTGCAATT 
181 GtSStC ATCCTCAGTA TGGTGCTCCT CTGCTTATGC GCAGACTTTA A 



QNFA1WEAKMAVAMILHKFSFELPSYTHAPFAIVTIHPQYGAPLLMRRL 
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SEQ ID 109 D70A-BD4 



U \ ^'cSTSS^S^ TCATCCACTA 
15! SaaTAT.* TGCTCAATTA ATTTTGTACA ASTTGTAG 

SEQ ID 110 

QNFAMLEAKMAMAMILKTYAFELSPSYAHAPHPLLLQPQYGAQLILYKL 



FI . 56 



SEQ ID 111 D181-AC5 



111 ^^m-n 

1 TATAGCATGG GGCTCAAGGC GATTCAAGCT AGCTTAGCTA ^c^cA 
£ tStA ACTTGCTACT GTGATTGAGC 

181 CAAGACTTTC ACCAAAACTT TACTCTGTTT OA 
YSMGL^IQASLANLLHGFNWSLPDNWTPEDLNMDEIFGLSTPKKFPIjATVIEPRLSPKLYSV 
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«FO id 113 D144-AH1 

° i tat agcttggggc tcaaggagat tcaagctagc ataatatgac tcctgaggac 

£ 35SS SSS55S S33SS S= SS- ™ 

181 attgagccaa gactttcacc aaaactttac tctgtttga 

YSLGLKEIQASLANLLHGFNWSLPDNMTPEDLNMDEIFGLSTPKKFPIATVIEPRIiSPKLYSV 
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SEQ ID 115 D34-65 



15 L»J«~"0-' 

1 CATAGCTTG GGGCTCAAGG TGACTCCTGA GGACCTCAAC 

J! SSE SSSS ££££ CACTTGCTAC TGTGATTGAG 

181 CCAAGACTTT CACCAAAACT TTACTCTGTT TGA 
HSLGLWIQAS1ANLLHGFNWSLPDNMTEEDLNMDEIFGLSTPKKFPLATVIEPRLSPKLYSV 
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SEQ ID 117 D35-BG2 



SEQ ID 118 

LCFPCLI S S YI LALNVNLYHNFLQI S PS I S Y 



FIG. 60 

241 ctttcagaaa tatcttaa 
^q^gLuatlvqcfemkrvseevvdltegkgltmpkpepi^ceardifhkvlseis 
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SEO ID 121 D58-AA1 

1 TTGGGCTTG GCAACGGTGC ATGTGAATTT GATGTTGGCC rrATTTTACT 
«i rrAATGATTC AAGAATTTGA ATGGTCCGCT TACCCGGAAA ATAGGAAAGT GGATTTTACT 

121 ga^?gg aSactgt ggtgatgaaa aatcctttaa gagctaaggt caagccaaga 
181 atgcaagtgg tgtaa 

LG^T VHVN LMLAPMI Q E FEW SAY P EN RKVD FTEKLE FTWMKN P LRAKVK P RMQW 



FIG. 62 

SEQ ID 123 D73A-AE10 



1 TSSss sssss ssss 

241 TGCCCCAGAG TTAACTCTAT TTGA 
YALAMLHLEYFVAN^ 



FIG. 63 

SEQ ID 125 D56A-AC12 



GGTC^VGCAAG TTGGACTTCT TAGAACAACC ATTTTCATCG CCTCATTACT GTCTGAATAT 
61 AAGCTGAAAC CTCGCTCACA CCAGAAACAA GTTGAACTCA CCGATTTAAA TCCAGCAAGT 

m cLtaaaagg cgaactgtta gtcgatgcga ttcctcgaaa gaaggcggca 



181 TTTTAA 



GQQVGLLRTT I FI ASLLS EYKLKPRSHQKQVELTDLNPASWLHS I KGELLVDAI PRKKAAF 



FIG. 64 



SEQ ID 127 D177-BF7 

12 ; sss sssss sssss I5= ««« »-»-• 

181 GATTTGGACG TGGAGGAAGC TGCTGGAATT ACTGTTAGAA GGAAGTTCCC CCTTTTAWA. 
241 GTCGCCACTC CATGCTCGTG A 

ITFAKFVNELALARLMraFDFSLPKGVK 



FIG. 65 

<iFO ID 129 D73A-AG3 

&JSy - rarrTATrC T ATAAACCATT TGATGCTCTT TATTGCGTTG 

12 \ tocacggctc tgattgmtt^caagaggcac aaaacggacg gctgtgatga tatcgcgtat 
is! ttgctccaaa ggatgattgt aaagtgttcc tttcacagag gtgcactcga 

241 TTCCCATCTT TTTCATGA 
QRYAINHLMLFIALFTALIDFKRHKTDGCDDIAYIPTIAPKDDCKVFLSQRCTRFPSFS 



FIG. 66 



<5fo id 131 d70a-aa12 

241 atcaaccttc tcgagagtaa 
msIg^lvLlaqllyhfdwklptgikprdldltelsgitiarkgdlylnatpyqpsre 



FIG. 67 

" ' 2 =S =ss= ssss 

181 ATGCAAGTGG TGTAA 
LGLATVHVNLMLART I QEFEWS AY P ENRKVDFTEKLEFTVVMKN PLRAKVKP RMQW 



FIG. 68 



SEO ID 135 D185-BG2 

121 AGAAATTGGA ATTTACTGTG GTGA 
SEQ ID 136 

LGLATVHWLMLARMIQEFEWSAYPENRKVDLLRNWNLLW 



FIG. 69 



SEO ID 137 D185-BE1 

1 ATCACATTT GCTAAGTTTG TGAATGAGCT AGCATTGGCA / ,^ mninirrs . 
61 AGATTAATGT TCCATTTTGA TTTCTCGCTA CCAAAAGGAG TTAAGCATGA GGATTTGGAC 
121 CTGCTGGAAT TACTGTTAGG AGGAAGTTCC CCCTTTTAGC CGTCGCCACT 

181 CCATGCTCGT GA 



it?^^eLuarlmfhfdfslpkgvkhedldveeaagitvrrkfpliavatpcs 



FIG. 70 



SEO ID 139 D185-BD2 

1 ATCACATTT GCTAAGTTTG TGAATGAGCT AGCATTGGCA ^^^r 
61 AGMTAATGT TCCATTTTGA TTTCTCGCTA CCAAAAGGAG TTAAGCATGC GGATTTGGAC 
GTGGAGGAAG CTGCTGGAAT TACTGTTAGA AGGAAGTTCC CCCTTTTAGC CGTCGCCACT 
181 CCATGCTCGT SA 



it^^eLuarmfhfdfslpkgvkhadldveeaagitvrrkfpllavatpcs 



FIG. 71 



«wr» TD 141 D176-BG2 

1 CA AAATTTTGCC ATGTTAGAAG CAAAGACTAC TTTGGCTATG TCAGTCCATA 
151 ATCCTACAAC GCTTCTCCTT TGAACTGTCT CCATCTTATG CACATGCTCC TCAGTCCATA 
ill ataa™c AACCCCAGTA TGGTGCTCCA CTTATTTTGC ATAAAATATA 
SEO ID 142 

QNFAMIjEAKTTLAMILQRFS FELS PS YAHAPQSI ITLQPQYGAPLI LHKI 



FIG. 72 



ID ^t£ G B^^^^^ AGTCAAAGCT TGACACAACA 

m gaSIaggcg g^at^ TCTGCACATT ttgaagcatt CCACCATTGT GATGAAACCA 
181 agatcttttt aa 
i^pil"tlgrlvqnfellpppgqskldttekggqfsi,hilkhstivmkprsf 



FIG. 73 



avrt TD 145 D176-BC3 m 

EE = is = esse ssss 

2 301 GCAAAATGGG AATGCATTTG CACTCGTGCA CTGTAGATTG TTGTAA 
QNFA^EAOTTIiAMILQRFSFELSPSYAHAPQSIITCNPSMVLHLFCIKYSLLLVSSVSFYVKHESKMLRLVELQNGNA 

FALVHCRLL 



FIG. 74 

SEQ ID 147 D176-BB3 



fil RTGGGGTTGC GAGCAGTTTC TTTGGCATTA GGTGCACTTA TTCAATGCTT TGACTGGCAA 

121 aSgaag cSSctt GGAGGAAAGC tataattcta gaatgactat gcagaacaag 
m ccttSg ttgtctgcac tccacgcgaa gatcttggcc agcttctatc ccaactctaa 

mmgl^avsLlgaliqcfdwqieeaesleesynsrmtmqnkplkwctpredlgqllsql 



FIG. 75 



NAME D89-AB1 
ORGANISM N I COT I ANA TABACUM 



SEQ. ID. NO. 149 m ^,,^rra raTTPAGTTT TCTAACTTAG TTGCATTCTT 

1 CTTCCTTCCT AAGTCCTAAC TAM^TGGA ^ATTCAGTTT TCT^^ 

61 GCTCTTTCTC TCCAGCWCT TTCTTCTATT CMMft^ cACCATTTGG CTGTGGCAGG 
121 GCCTCCTGGT CCATGGAAAT TACCTTTTAT £ TATGGTCCTC TTATGCATTT 

181 TCCACTTCCT CACCATGGCC TAAAAAATTT AGC^AAA ^tggcAA AAGAAGTACT 
241 ACAACTTGGA CAAATTCCTA CACTCATCAT ATCATCA^ GTCGCGGCCG ACATCATTCA 
301 AAAAACTCAC ^CTCGCTT TTGCCACTAG ACCAAAGCTT GT^^ 
361 CTACGACAGC AC GGACAT AG CATTTTCTCC TTTTTTAGCT CGATTCGCCA 

421 AATTT GCATA TTGGAACTCT TGAGTGCCAA ^GGTCAAA ^TTTTTAG 

- ~ ™ EEs =i =s = 

601 ATGTGGTGAC CAAGACAAAT TGA1CA1 1 1 1 i^i CATGATATTG ATGGTTCGAA 

661 ATTTAGTATT GCTGATTTTT TCCCTACATG SttWOTt CATGATATTG A ^ 
721 ATCTAAACTG GTGAAAGCAC ATCGTAAGAT ^GATGAAATT GTGAAGATTT 
781 GCACAAAAAG AACAGAGCAG ATGGCAAGAA GGGTAATGGT TCACAAATGA 
841 GATTGATGTA TTGTTAAGAG "AGAGAAAG TGGAGAAGTT CAAMTC^ 
901 CAATATCAAA TCAATATTAA TCGACATGTT C^CAG^A AGGCACAAGC 
t S£ tSaIgg caagctt GA AGGAGAAAAA ScAA CAGATTGATC TTGATGAGCT 
io 2 81 « AAGTTAGTAA = GAAAC CTTAAGAATG CACCCTCCAA TTCCTCTATT 
1141 AGTTCCTAGA GAATGTATGG AGGATACAAA GA™GGT TACAA ATGACCCCGA 
1201 AAGAGTCATA GTTAATGCAT GGGCAATCGG ACGAGATCCA £AAAGT ATCATCATCA 
1261 AAGCTTTATG CCAGAGAGAT TTGAGAATAG TTCTATTGAC TTTCTTG GTTTAGCTAA 
13 21 GTTTATACCA TTTGGTGCAG ^A*GGAT ^TGTCCGGGA ATGCTA^ ctaatggaca 
1381 TGTTGGACAA CCTTTAGCTC AGTTACTTTA TCACTTCbA GAAAGGATGA 
USi SS cS T T C C GTAGCAGAAA TAAAAAGCCG 

1561 GGGCAAACAG AAAAAA 

SEQ. ID. NO. 150 tttkkwKTRKL NLPPGPWKLP FIGSLHHLAV AGPLPHHGLK 

i =s =| s = r= 

£ SEE =1 m = ™ Si 

JS SEEK « SS 

481 ESPGISATRK DDLVLIATPY DSY 



FIG. 76 



NAME D89-AD2 
ORGANISM N I COT I ANA TABACUM 



. HSSS = SSSS S5SSS 

; iili = s=£ sssss sssss = 

~ 5SESS SEE =S 3S3S SSSK 
5KSS SSSSS S5E SSS = SEES 
SS SEES SSSS SSSSS = 

Es sssss ssss tsrsss = ssss 
Si ssss; = ss = 

"EE = SSSE =£2S 5KS5 

infl CAAGCTGAAG TAAGGCAAGC TTTGAAGGAG AAAAAAGGTT TTCAACAGAT TGATCTTGAT 

E= iss ekks s= = ess 
EE ss sssss sssss = = 

All Stotta TACCATTTGG TGCAGGAAGA aggatttgtc cgggaatgct atttggttta 

™ = = SEES 5SSSS skk 

«W GATGATCTTG TTTTGATTGC CACTCCTTAT GATTCTTATT AAGCAGTAGC AGAAATAAAA 
1561 AGCCGGGGCA AACAGAAAAA A 



SEQ. ID. NO. 152 



1 MEIOFSNLVA FLLFLSSIFL LFKKWKTRKL NLPPGPWKLP FIGSLHHLAV AGPLPHHGLK 
fil N^RYGPLM HLQLGQIPTL IISSPQMAKE VLKTHDLAFA TRPKLWADI IHYDSTDIAF 
121 RKICILELLS AKMVKFFSSI RQDELSKMLS SIRTTPNLTV HLTDKIFWFT 

ill IIwSg KICGDQDKLI IFMREIISIA GGFSIADFFP TWKMIHDIDG sksklvkahr 
24 KIDEILGNW DEHKKNRADG KKGNGEFGGE DLIDVLLRVR ESGEVQIPIT ^NIKSILID 
Hi MFSAGSETSS TTIIWALAEM MKKPSVLAKA QAEVRQALKE KKGFQQIDLD ELKYLKLVIK 
111 LLVPRECMED TKIDGYNIPF KTRVIVNAWA IGRDPESWDD PESFMPERFE 

IA NSSIDFLGNH HQFIPFGAGR RICPGMLFGL ANVGQPLAQL LYHFDWKLPN GQSHENFDMT 
481 ESPGISATRK DDLVLIATPY DSY 



FIG. 77 



NAME D90A-BB3 
ORGANISM N I COT I ANA TABACUM 

SEQ. ID. ^°^j^ GCAGT TTGAAGATAC CAACTAACCA AAATGCAGTT CTTCAGCTTG GTTTCCATTT 
61 TCCTATTTCT ATCTTTTCTC TTTTTGTTAA GGAAATGGAA GAACTCGAAT AGCCAAAGGA 
121 AAAAATTGCC ACCAGGTCCA TGGAAACTAC CAATACTAGG AAGTATGCTT CATATGGTTG 
181 GTGGACTACC ACACCATGTC CTTAGAGATT TAGCCAAAAA ATATGGACCG CTTATGCACC 
241 TTCAATTAGG TGAAGTTTCT GCAGTTGTGG TTACTTCTCC TGATATGGCA AAAGAAGTAC 
301 TAAAAACTCA TGACATCGCT TTCGCGTCTA GGCCTAGCCT TTTGGCCCCG GAGATTGTCT 
361 GTTACAATAG GTCTGATCTT GCGTTTTGCC CCTATGGCGA TTATTGGAGA CAAATGCGTA 
421 AAATATGTGT CTTGGAAGTG CTCAGTGCCA AGAATGTTCG GACATATAGC TCTATTAGGC 
481 GCGATGAAGT TCTTCGTCTC CTTAATTTTA TCCGGTCATC TTCTGGTGAG CCTGTTAATA 
541 TTACGGAAAG GATCTTTTTG TTCACAAGCT CCATGACATG TAGATCAGCG TTTGGGCAAG 
601 TATTCAAGGA GCAAGACAAA TTTATACAAC TAATTAAAGA AGTTATACTC TTAGCAGGAG 
661 GGTTTGATGT GGCTGACATA TTCCCTTCAT ACAAGTCTCT TCATGTGCTC AGTGGAATGA 
721 AGGGTAAGAT TATGAATGCA CACCATAAGG TAGATGCTAT TGTTGAGAAT GTCATCAACG 
781 AGCACAAGAA AAATCTTGCA ATTGGGAAAA CTAATGGAGC GTTAGGAGGT GAAGATTTAA 
841 TTGATGTTCT TCTAAAACTT ATGAATGATG GAGGCCTTCA ATTTCCTATC ACCAACGACA 
901 ACATCAAAGC TATAATCTTT GACATGTTTG CTGCTGGAAC AGAGACTTCA TCGTCAACAA 
961 TTGTGTGGGC TATGGTGGAA ATGGTGAAAA ATCCAACTGT ATTTGCGAAA GCTCAAGCAG 
1021 AAGTAAGAGA TGCATTTAGA GAAAAAGAAA CTTTTGATGA AAATGATGTG GAG GAGCTAA 
1081 ACTATCTAAA GTTAGTCATT AAAGAAACTC TAAGACTTCA TCCACCGGTT CCACTTTTGC 
1141 TCCCAAGAGA ATGTAGGGAA GAGACAAATA TAAACGGCTA CACTATTCCT GTAAAGACCA 
1201 AAGTCATGGT TAATGTTTGG GCATTGGGAA GAGATCCAAA ATATTGGGAT GATGCAGAAA 
1261 CTTTTAAGCC AGAGAGATTT GAGCAGTGCT CTAAGGATTT TGTTGGTAAT AATTTTGAAT 
1321 ATCTTCCATT TGGTGGTGGA AGGAGGATTT GTCCAGGGAT TTCGTTTGGT TTAGCTAATG 
1381 CTTATTTGCC ATTGGCTCAA TTACTTTATC ACTTTGATTG GGAACTCCCC ACTGGAATCA 
1441 AACCAAGCGA CTTGGACTTG ACTGAGTTGG TTGGAGTAAC TGCCGCTAGA AAAAGTGACC 
1501 TTTACTTGGT TGCGACTCCT TATCAACCTC CTCAAAAC 



1 MQFFSLVSIF LFLSFLFLLR KWKNSNSQRK KLPPGPWKLP ILGSMLHMVG GLPHHVLRDL 
61 AKKYGPLMHL QLGEVSAVW TSPDMAKEVL KTHDIAFASR PSLLAPEIVC YNRSDLAFCP 
121 YGDYWRQMRK ICVLEVLSAK NVRTYSSIRR DEVLRLLNFI RSSSGEPVNI TERIFLFTSS 
181 MTCRSAFGQV FKEQDKFIQL IKEVILLAGG FDVADIFPSY KSLHVLSGMK GKIMNAHHKV 
241 DAIVENVINE HKKNLAIGKT NGALGGEDLI DVLLKLMNDG GLQFPITNDN I KAI I FDMFA 
301 AGTETSSSTI VWAMVEMVKN PTVFAKAQAE VRDAFREKET FDENDVEELN YLKLVIKETL 
361 RLHPPVPLLL PRECREETNI NGYTIPVKTK VMVNVWALGR DPKYWDDAET FKPERFEQCS 
421 KDFVGNNFEY LPFGGGRRIC PGISFGLANA YLPLAQLLYH FDWELPTGIK PSDLDLTELV 
481 GVTAARKSDL YLVATPYQPP QN 
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NAME D95-AG1 

ORGANISM N I COT I AN A TABACUM 

SEQ * ^ 1°AAAAGATGTC TTCATTTTCC ACATCTTCTG CCACTTCTAA TTCCAAACTT CCAGTTCGAG 
61 AAATCCCAGG AGACTATGGT TTCCCCTTTT TTGGAGCCAT AAAAGATAGA TATGACTACT 
121 TCTACAACCT CGGCACAGAC GAATTCTTTC TTACCAAAAT GCAAAAATAC AACTCTACTG 
181 TCTTTAGAAC CAACATGCCA CCAGGTCCAT TCATTGCTAA AAATCCCAAA GTAATTGTTC 
241 TCCTCGATGC CAAAACATTT CCCGTTCTTT TCGACAACTC TAAAGTCGAA AAAATGAACG 
301 TTCTTGATGG CACGTACGTG CCATCTACTG ATTTCTATGG CGGATATCGC CCGTGTGCTT 
361 ATCTTGATCC TTCTGAGTCA ACTCATGCCA CACTTAAAGG GTTCTTTTTA TCTTTAATCT 
421 CCCAGCTTCA TAATCAATTT ATTCCTTTAT TTAGAACCTC AATTTCTGGT CTTTTCGCAA 
481 ATCTTGAGAA TGAGATTTCC CAAAATGGCA AAGCGAACTT CAACAATATC AGCGACATTA 
541 TGTCATTCGA TTTTGTTTTT CGTTTGTTAT GTGACAAGAC CAGTCCCCAT GACACAAATC 
601 TTGGCTCTAA TGGACCAAAA CTCTTTGATA TATGGCTGTT GCCTCAACTT GCTCCATTGT 
661 TTAGTCTAGG TCTAAAATTT GTGCCGAACT TTCTGGAAGA TTTAATGTTG CATACTTTTC 
721 CCTTGCCATT TTTTCTAGTG AGATCGAATT ACCAGAAGCT TTATGATGCT TTTAGCAAGC 
781 ATGCCGAAAG TACACTGAAT GAAGCAGAGA AGAATGGGAT CAAAAGAGAC GAAGCATGCC 
841 ACAACTTAGT TTTTCTTGCA GGTTTCAATG CTTATGGTGG GATGAAAGTT TTATTCCCTG 
901 CACTGATAAA GTGGGTCGCC AATGGAGGAA AGAGTTTACA CACTCGGCTG GCAAATGAAA 
961 TCAGGACAAT TATCAAAGAA GAATGTGGGA CCATAACTCT ATCAGCAATC AACAAGATGA 
1021 GT TTAGT AAA ATCAGTAGTG TATGAAGTAT TAAGAATTGA ACCTCCAGTT CCATTCCAAT 
1081 ATGGTAAAGC CAAAGAAGAT ATCATAATCC AAAGCCATGA TTCAACTTTC TTAGTCAAGA 
1141 AAGGTGAAAT GATCTTTGGA TATCAGCCTT TTGCTACAAA AGATCCAAAG ATTTTTGACA 
1201 AACCAGAGGA GTTTATTCCG GAGAGGTTCA TGGCCGAAGG GGAAAAATTA TTAAAGTATG 
1261 TGTATTGGTC AAATGCAAGA GAGACAGATG ATCCAACGGT GGACAACAAA CAATGCCCAG 
1321 CGAAAAATCT TGTCGTGCTT TTGTGCAGGT TGATGTTGGT GGAGGTTTTC ATGCGTTACG 
1381 ACACATTCAC AGTGGAGTCA ACAAAGCTCT TTCTTGGGTC ATCAGTAACG TTCACGACTC 
1441 TGGAAAAAGC GACATGAGTT TCAGATATCT TAATTGTAGG CTGCAAATAA TAATGTGGTC 
1501 ATTCTGCAAA TTATTGTACT TGTGCTGATG 

SEQ. ID. N0^156 tssat SNSKLpvREI pGDYGFPFFG AIKDRYDYFY NLGTDEFFLT KMQKYNSTVF 
61 RTNMPPGPFI AKNPKVIVLL DAKTFPVLFD NSKVEKMNVL DGTYVPSTDF YGGYRPCAYL 
121 DPSESTHATL KGFFLSLISQ LHNQFIPLFR TSISGLFANL ENEISQNGKA NFNNISDIMS 
181 FDFVFRLLCD KTSPHDTNLG SNGPKLFDIW LLPQLAPLFS LGLKFVPNFL EDLMLHTFPL 
241 PFFLVRSNYQ KLYDAFSKHA ESTLNEAEKN GIKRDEACHN LVFLAGFNAY GGMKVLFPAL 
301 IKWVANGGKS LHTRLANEIR TIIKEECGTI TLSAINKMSL VKSWYEVLR IEPPVPFQYG 
361 KAKEDIIIQS HDSTFLVKKG EMI FGYQPFA TKDPKIFDKP EEFIPERFMA EGEKLLKYVY 
421 WSNARETDDP TVDNKQCPAK NLWLLCRLM LVEVFMRYDT FTVESTKLFL GSSVTFTTLE 
481 KAT 



FIG. 79 



NAME D96-AB6 

ORGANISM N I COT I ANA TABACUM 

SEQ. ID. NO. 157 

1 CCAAAAATGG AGCTTCAATC TTCTCCTTTC AATTTAATTT CTTTGTTCCT CTTCTTTTCT 
61 TTTCATTTTA TTCTAGTGAA GAAATGGAAT GCCAAAATCC CAAAGTTACC TCCAGGTCCG 
121 TGGAGGCTTC CCTTTATTGG AAGCCTCCAT CACTTGAAGG GAAAACTTCC ACAC CAT AAT 
181 CTTAGAGATC TAGCGCGAAA ATATGGGCCT CTCATGTACT TACAACTCGG AGAAATTCCT 
241 GTAGTTGTAA TATCTTCGCC ACGTGTAGCA AAAGCTGTAC TAAAAACTCA TGATCTCGCT 
301 TTTGCAACTA GACCACGATT CATGTCCTCA GACATTGTGT TTTACAAAAG CAGGGACATC 
361 TCTTTTGCCC CATTTGGTGA TTACTGGAGA CAGATGCGTA AAATATTGAC TCAGGAACTC 
421 CTGAGTAACA AGATGCTCAA GTCATATAGC TTAATCCGAA AGGATGAGCT CTCGAAGCTC 
481 CTCTCATCGA TTCGTTTGGA AACAGGTTCT GCAGTGAACA TAAATGAAAA GCTTCTCTGG 
541 TTTACGAGCT GCATGACCTG TAGATTAGCC TTTGGAAAAA TATGCAATGA TCGGGATGAG 
601 TTGATCATGC TAATTAGGGA GATATTAACA TTATCAGGAG GATTTGATGT GGGTGATTTG 
661 TTCCCTTCCT GGAAATTACT TCATAATATG AGCAACATGA AAGCTAGGTT GACGAATGTA 
721 CACCACAAGT ATGATTTAGT TATGGAGAAC ATCATCAATG AGCACCAAGA GAATCATGCA 
781 GCAGGGATAA AGGGTAACAA CGAGTTTGGT GGCGAAGATA TGATCGATGC TCTACTGAGG 
841 GCTAAGGAGA ATAATGAGCT TCAATTTCCT ATCGAAAATG ACAACATGAA AGCAGTAATT 
901 CTGGACTTGT TTATTGCTGG AACTGAAACT TCATATACTG CAATTATATG GGCACTATCA 
961 GAATTGATGA AGCACCCAAG TGTGATGGCC AAGGCACAAG CTGAAGTGAG AAAAGTCTTC 
1021 AAAGAAAATG AAAATTTCGA CGAAAATGAT CTTGACAAGT TGCCATACTT AAAATCAGTG 
1081 ATTAAAGAAA CACTAAGGAT GCACCCTCCA GTTCCTTTGT TAGGGCCTAG AGAATGCAGG 
1141 GACCAAACAG AGATCGATGG CTACACTGTA CCTATTAAAG CTAGAGTTAT GGTTAATGCT 
1201 TGGGCGATAG GAAGAGATCC TGAAAGTTGG GAAGATCCTG AAAGTTTCAA ACCGGAGCGA 
1261 TTTGAAAATA CTTCTGTTGA TCTTACAGGA AATCACTAT C AGTTCATTCC TTTCGGTTCA 
1321 GGAAGAAGAA TGTGTCCAGG AATGTCGTTT GGTTTAGTTA ACACAGGGCA TCCTTTAGCC 
1381 CAGTTGCTCT ATTGCTTTGA CTGGAAACTC CCTGACAAGG TTAATGCAAA TGATTTTCGC 
1441 ACTACTGAAA CAAGTAGAGT TTTTGCAGCA AGCAAAGATG ACCTCTACTT GATTCCCACA 
1501 AATCACAGGG AGCAAGAATA GCTTAATTTA ATGGAGTTCT TGGAAGAATT AAAGAAGAAG 
1561 GGCTATATAG GTGAGATTTT TTGTATGGTT GCA 

SEQ. ID. NO. 158 

1 MELQSSPFNL ISLFLFFSFH FILVKKWNAK IPKLPPGPWR LPFIGSLHHL KGKLPHHNLR 
61 DLARKYGPLM YLQLGEI PW VISSPRVAKA VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 
121 APFGDYWRQM RKILTQELLS NKMLKSYSLI RKDELSKLLS SIRLETGSAV NINEKLLWFT 
181 SCMTCRLAFG KICNDRDELI MLIREILTLS GGFDVGDLFP SWKLLHNMSN MKARLTNVHH 
241 KYDLVMENII NEHQENHAAG IKGNNEFGGE DMI DALLRAK ENNELQFPIE NDNMKAVILD 
301 LFIAGTETSY TAIIWALSEL MKHPSVMAKA QAEVRKVFKE NENFDENDLD KLPYLKSVIK 
361 ETLRMHPPVP LLGPRECRDQ TEIDGYTVPI KARVMVNAWA IGRDPESWED PESFKPERFE 
421 NTSVDLTGNH YQFIPFGSGR RMCPGMSFGL VNTGHPLAQL LYCFDWKLPD KVNANDFRTT 
481 ETSRVFAASK DDLYLIPTNH REQE 
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NAME D96-AC2 

ORGANISM NI COT I ANA TABACUM 

SEQ. ID. NO. 159 

1 CTTCTTCCAA AAATGGAGCT TCAATCTTCT CCTTTCAATT TAATTTCTTT GTTCCTCTTC 
61 TTTTCTTTTC TTTTTATTCT AGTGAAGAAA TGGAATGCCA AAATCCCAAA GTTACCTCCA 
121 GGTCCGTGGA GGCTTCCCTT TATTGGAAGC CTCCATCACT TGAAGGGAAA ACTTCCACAC 
181 CATAATCTTA GAGATCTAGC GCGAAAATAT GGACCTCTGA TGTACTTACA ACTCGGAGAA 
241 ATTCCTGTAG TTGTAATATC TTCGCCACGT GTAGCAAAAG CTGTACTAAA AACTCATGAT 
301 CTCGCTTTTG CAACTAGACC ACGATTCATG TCCTCAGACA TTGTGTTTTA CAAAAGCAGG 
361 GACATCTCTT TTGCCCCATT TGGTGATTAC TGGAGACAGA TGCGTAAAAT ATTGACTCAG 
421 GAACTCCTGA GTAACAAGAT GCTCAAGTCA TATAGCTTAA TCCGAAAGGA TGAGCTCTCG 
481 AAGCTCCTCT CATCGATTCG TTTGGAAACA GGTTCTGCAG TGAACATAAA TGAAAAGCTT 
541 CTCTGGTTTA CGAGCTGCAT GACCTGTAGA TTAGCCTTTG GAAAAATATG CAATGATCGG 
601 GATGAGTTGA TCATGCTAAT TAGGGAGATA TTAACATTAT CAGGAGGATT TGATGTGGGT 
661 GATTTGTTCC CTTCCTGGAA ATTACTTCAT AATATGAGCA ACATGAAAGC TAGGTTGACG 
721 AATGTACACC ACAAGTATGA TTTAGTTATG GAGAACATCA TCAATGAGCA CCAAGAGAAT 
781 CATGCAGCAG GGATAAAGGG TAACAACGAG TTTGGTGGCG AAGATATGAT CGATGCTCTA 
841 CTGAGGGCTA AGGAGAATAA TGAGCTTCAA TTTCCTATCG AAAATGACAA CATGAAAGCA 
901 GTAATTCTGG ACTTGTTTAT TGCTGGAACT GAAACTTCAT ATACTGCAAT TATATGGGCA 
961 CTATCAGAAT TGATGAAGCA CCCAAGTGTG ATGGCCAAGG CACAAGCTGA AGTGAGAAAA 
1021 GTCTTCAAAG AAAATGAAAA TTTCGACGAA AATGATCTTG ACAAGTTGCC ATACTTAAAA 
1081 TCAGTGATTA AAGAAACACT AAGGATGCAC CCTCCAGTTC CTTTGTTAGG GCCTAGAGAA 
1141 TGCAGGGACC AAACAGAGAT CGATGGCTAC ACTGTACCTA TTAAAGCTAG AGTTATGGTT 
12 01 AATGCTTGGG CGATAGGAAG AGATCCTGAA AGTTGGGAAG ATCCTGAAAG TTTCAAACCG 
1261 GAGCGATTTG AAAATACTTC TGTTGATCTT ACAGGAAATC ACTATCAGTT CATTCCTTTC 
1321 GGTTCAGGAA GAAGAATGTG TCCAGGAATG TCGTTTGGTT TAGTTAACAC AGGGCATCCT 
1381 TTAGCCCAGT TGCTCTATTG CTTTGACTGG AAACTCCCTG ACAAGGTTAA TGCAAATGAT 
1441 TTTCGCACTA CTGAAACAAG TAGAGTTTTT GCAGCAAGCA AAGATGACCT CTACTTGATT 
1501 CCCACAAATC ACAGGGAGCA AGAATAGCTT AATTTAATGG AGTTCTTGGA AGAATTAAAG 
1561 AAGAAGGGCT ATATAGGTGA GATTTTTTGT ATGGTTGCA 

SEQ. ID. NO. 160 

1 MELQSSPFNL ISLFLFFSFL FILVKKWNAK IPKLPPGPWR LPFIGSLHHL KGKLPHHNLR 
61 DLARKYGPLM YLQLGEIPW VISSPRVAKA VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 
121 APFGDYWRQM RKILTQELLS NKMLKSYSLI RKDELSKLLS SIRLETGSAV NINEKLLWFT 
181 SCMTCRLAFG KICNDRDELI MLIREILTLS GGFDVGDLFP SWKLLHNMSN MKARLTNVHH 
241 KYDLVMENII NEHQENHAAG IKGNNEFGGE DMIDALLRAK ENNELQFPIE NDNMKAVILD 
301 LFIAGTETSY TAI IWALSEL MKHPSVMAKA QAEVRKVFKE NENFDENDLD KLPYLKSVIK 
361 ETLRMHPPVP LLGPRECRDQ TEIDGYTVPI KARVMVNAWA IGRDPESWED PESFKPERFE 
421 NTSVDLTGNH YQFIPFGSGR RMCPGMSFGL VNTGHPLAQL LYCFDWKLPD KVNANDFRTT 
481 ETSRVFAASK DDLYLIPTNH REQE 
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NAME D98-AA1 

ORGANISM N I COT I ANA TABACUM 

SEQ. ID. NO. 161 

1 CTTTCTTTCT TGTACCGAGA TGGAGTTTCA ACACTTGGTT TCGTTCTTGC TATTCATCTC 
61 CTTCATCTTT CTTCTAATTC AAAAATGGAG GAAATCGAAA AAGCTGCCAC CTGGTCCGTG 
121 GAGGCTACCT ATTATTGGAA GTGTGCATCA CTTGACAAGT GGAGTACCAC ATCGAGTTCT 
181 CAGAAATTTA TCACAAAAAT TTGGCCCGAT CATGTACTTG CAGCTCGGGG AAGTTCCCAC 
241 AGTAGTTGTA TCCTCCCCAC ACATGGCCAA ACAAATTTTA AAAACTCATG ACCTCGCTTT 
301 TGCATCTAGG CCAGAAATCA TGATGGGAAA AATTATTTGC TACGATTGTA AGGACATTGC 
361 CTTTTCCCCG TATGGTGATT ATTGGAGACA TATGCGTAAA TTGAGCACCT TGGAACTACT 
421 TAGTGCCAAG ATGGTCAAGT CCTTCAGTCC AATTCGTCAA GATGAGCTCT CAAGTCTCCT 
481 ATCATCCATT GAATCAATGG GAAATTTGCC AATCAACTTA GTAGAAAAAC TTTTATGGTT 
541 TATGAATGCC GCGACATGTA GGTCAGCATT TGGGAAAGTG TGTAAAGATC AAAAAGAGTT 
601 GATAACATTG ATTCAACGAG CAGAATCATT ATCTGGTGGA TTCGAGCTGG CTGATTTGTT 
661 CCCTTCGAAG AAGTTTCTAC ATGGTATTAG TGGGATGCGA TCTAAACTAA TGGAAGCTCG 
721 TAACAAGATA GACGCAGTCT TGGACAACAT TATCAATGTG CACAGAGAGA ATCGGGCAAA 
781 TGGAAATAGT TGTAATGGTG AGTCTGGAAC TGTAGATTTC ATCGATGTTT TTCTAAGGGT 
841 CATGGAGAGT GGCGAATTAC CATTTCCGAT AGAAAATGAC AACATCAAAG CAGTTATTCT 
901 TGACATGTTC GTAGCAGGAT CTGACACATC ATCTTCAACC GTTATTTGGG CATTAACAGA 
961 AATGATGAAG AATCCAAAAG TCATGGCTAA AGCACAAGCT GAAGT GAGAG AAGCTTTTAA 
1021 AGGAAAGAAA GCATGTGATG AGGATACTGA TCTTGAAAAG CTTCATTACC TAAATTTAGT 
1081 GATCAAAGAG ACACTCCGAT TACACCCTCC AACTCCTCTA CTTGTCCCGC GAGAATGCAG 
1141 GGAGGAAACA GAGATAGAAG GATTCACTAT ACCATTGAAA AGCAAAGTCT TGGTTAACGT 
1201 ATGGGCAATT GGAAGAGATC CCGAGAATTG GAAAAATCCT GAATGTTTTA TACCAGAGAG 
1261 ATTCGAAAAT AGTTCTATTG AGTTTACTGG AAATCATTTT CAACTTCTTC CGTTTGGCGC 
1321 TGGAAGACGA ATTTGTCCAG GAATGCAATT TGGTTTGGCT CTTGTTACTC TGCCATTGGC 
1381 TCATTTGCTT CACAATTTTG ATTGGAAACT TCCCGAAGGA ATTAAT GCAA GGGATTTGGA 
1441 CATGACAGAG GCAAATGGGA TATCTGCTAG AAGAGAAAAA GATCTTTACT TGATTGCTAC 
1501 TCCTTATGTA TCACCTCTTG ATTAACTCTG AAATTTTGCT TTAATGCTGC TTGCTTGCTT 
1561 CACT 



SEQ. ID. NO. 162 

1 MEFQHLVSFL LFISFIFLLI QKWRKSKKLP PGPWRLPIIG SVHHLTSGVP HRVLRNLSQK 
61 FGPIMYLQLG EVPTVWSSP HMAKQILKTH DLAFASRPEI MMGKIICYDC KDIAFSPYGD 
121 YWRHMRKLST LELLSAKMVK SFSPIRQDEL SSLLSSIESM GNLPINLVEK LLWFMNAATC 
181 RSAFGKVCKD QKELITLIQR AESLSGGFEL ADLFPSKKFL HGISGMRSKL MEARNKIDAV 
241 LDNIINVHRE NRANGNSCNG ESGTVDFIDV FLRVMESGEL PFPIENDNIK AVILDMFVAG 
301 SDTSSSTVIW ALTEMMKNPK VMAKAQAEVR EAFKGKKACD EDTDLEKLHY LNLVI KETLR 
361 LHPPTPLLVP RECREETEIE GFTIPLKSKV LVNVWAIGRD PENWKNPECF IPERFENSSI 
421 EFTGNHFQLL PFGAGRRICP GMQFGLALVT LPLAHLLHNF DWKLPEGINA RDLDMTEANG 
4 81 I SARREKDLY LIATPYVSPL D 



FIG. 82 



NAME D98-AG1 

ORGANISM NICOTIANA TABACUM 

SEQ. ID. NO. 163 CCGAGATGGA GTTTCAACAC TTGGTTTCGT TCTTGCTATT CATCTCCTTC 

61 ATCTTTCTTC TAATTCAAAA ATGGAGGAAA TCGAAAAAGC TGCCACCTGG TCCGTGGAGG 
121 CTACCTATTA TTGGAAGTGT GCATCACTTG ACAAGTGGAG TACCACATCG AGTT CTCAGA 
hi Stttatcac AAAAATTTGG CCCGATCATG TACTTGCAGC TCGGGGAAGT tcccacagta 
241 GTTGTATCCT CCCCACACAT GGCCAAACAA ATTTTAAAAA CTCATGACCT CGCTTTTGCA 
301 TCTAGGCCAG AAATCATGAT GGGAAAAATT ATTTGCTACG ATTGTAAGGA CATTGCCTTT 
111 ?CCCCGTMG GTGATTATTG GAGACATATG CGTAAATTGA GCACCTTGGA ACTACTTAGT 
421 GCCAAGATGG TCAAGTCCTT CAGTCCAATT CGTCAAGATG AGCTCTCAAG TCTCCTATCA 
481 TCCATTGAAT CAATGGGAAA TTTGCCAATC AACTTAGTAG AAAAACTTTT ATGGTTTATG 
541 AATGCCGCGA CATGTAGGTC AGCATTTGGG AAAGTGTGTA AAGATCAAAA AGAGTTGATA 
601 ACATTGATTC AACGAGCAGA ATCATTATCT GGTGGATTCG AGCTGGCTGA TTTGTTCCCT 
661 TCGAAGAAGT TTCTACATGG TATTAGTGGG ATGCGATCTA AACTAATGGA AGCTCGTAAC 
i2i Iaga^agacg cagtcttgga caacattatc AATGTGCACA GAGAGAATCG GGCAAATGGA 
781 aIS^TA ATGGTGAGTC TGGAACTGTA GATTTCATCG ATGTTTTTCT AAGGGTCATG 

Hi Stggcg aattaccatt tccgatagaa aatgacaaca tcaaagcagt tattcttgac 

901 ATGTTCGTAG CAGGATCTGA CACATCATCT TCAACCGTTA TTTGGGCATT AACAGAAACG 
III CAAAAGTCAT GGCTAAAGCA CAAGCTGAAG TGAGAGAAGC TTTTAAAGGA 

1021 AAGAAAGCAT GTGATGAGGA TACTGATCTT GAAAAGCATC ATTACCTAAA TTTAGTGATC 

"m S TCCGATTACA ccctccaact cctctacttg tcccgcgaga atgcagggag 
1141 gaaacagaga tagaaggatt cactatacca ttgaaaagca aagtcttggt taacgtatgg 
1201 gcaattggaa gagatcccga gaattggaaa aatcctgaat gttttatacc agagagattc 

1261 GAAAATAGTT CTATTGAGTT TACTGGAAAT CATTTTCAAC TTCTTCCGTT TGGCGCTGGA 
W2l a^aSttt GTCCAGGAAT GCAATTTGGT TTGGCTCTTG TTACTCTGCC ATTGGCTCAT 
llll T^C^ACA ATTTTGATTG GAAACTTCCC GAAGGAATTA ATGCAAGGGA TTTGGACATG 
1441 S ATGGGATATC TGCTAGAAGA GAAAAAGATC TTTACTTGAT TGCTACTCCT 
1501 TATGTATCAC CTCTTGATTA ACTCTGAAAT TTTGCTTTAA TGCTGCTTGC TTGCTTCACT 

SEQ. ID. NO^ 164 LFISFIFLLI QKWRKSKKLP PGPWRLPIIG SVHHLTSGVP HRVLRNLSQK 
6 1 FGPIMYLQLG EVPTVWSSP HMAKQILKTH DLAFASRPEI MMGKIICYDC KDIAFSPYGD 
121 YWRHMRKLST LELLSAKMVK SFSPIRQDEL SSLLSSIESM GNLPINLVEK LLWFMNAATC 
181 RSAFGKVCKD QKELITLIQR AESLSGGFEL ADLFPSKKFL HGISGMRSKL MEMHnDW 
241 LDNIINVHRE NRANGNSCNG ESGTVDFIDV FLRVMESGEL PFPIENDNIK AVILDMFVAG 
301 SDTSSSTVIW ALTETMKNPK VMAKAQAEVR EAFKGKKACD EDTDLEKHHY ^NLVIKETLR 
361 LHPPTPLLVP RECREETEIE GFTIPLKSKV LVNVWAIGRD PENWKNPECF IPERFENSSI 
421 EFTGNHFQLL PFGAGRRICP GMQFGLALVT LPLAHLLHNF DWKLPEGINA RDLDMTEANG 
481 ISARREKDLY LIATPYVSPL D 



FIG. 83 



NAME D100-BE2 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 165 

1 CAAAAACAAA ATTCCAATGG TTAACATGTT CACTCCAATT ATATACGCTC CTCTCCTTTT 
61 AGCTTTTTAC ATTATCACAA AACATTTCTT ACGCAAACTC AGAAATAATC CACCAGCTCC 
121 ATTTCTTACT TTCCCCTTTA TTGGCCATCT TTATCTCTTC AAAAAACCAC TTCAACGTAC 
181 CTTAGCCAAA ATCTCCGAAC GTTATGGCTC TGTTCTTCTA CTCGAATTCG GTTCACGAAA 
241 AGTACTTTTG GTTTCTTCAC CATCTGCAGC TGAAGAATGC TTAACAAAAA AC GAT ATT AT 
301 TTTCGCGAAT CGTCCTCTTT TGATGGCTGG AAAACATCTT GGATATAATT TTACATCTTT 
361 GGCTTGGAGT TCGTACGGAG ATCATTGGAG AAATCTGCGA AGGATTACTT CAGTTGAGAT 
421 GTTTTCGACT CATCGTCTTC AAATGCTACA TGGGATTCGT ATTGATGAAG TGAAATCTAT 
481 GGTTAAGAGG CTCAATTCCT CTGCCATAGC TGAAAAATCT GTGGATATGA AGTCTATGTT 
541 TTTTGAGCTG ATGCTCAATG TT AT GAT GAG GACAATTGCT GGAAAAAGAT ATTACGGTGA 
601 GAATGTGGAG GACATTGAGG AAGCTACGAG ATTCAAAGGT TTGGTGCAAG AGACTTTCAG 
661 GATTGGCGGG GCGACGAATA TTGGCGACTT TTTGCCGGCG TTGAAGTTAT TGGTGAGGAA 
721 ATTGGAGAAA AGTTTAATTG TGTTGCAAGA GAACAGAGAT GAGTTTATGC AGGAATTAAT 
781 TAAAGATTGC AGAAAAAGAA TGGAGAAAGA AGGTACTGTT ACTGATTCAG AAATT GAAGG 
841 GAACAAGAAA TGTTTAATTG AAGTTTTGTT AACACTACAA GAAAATGAAC CGGAATACTA 
901 CAAAGATGAA ATCATCAGAA GCCTTATGCT TGTTCTATTA TCAGCTGGTA CAGATACTTC 
961 AGTTGGGACA ATGGAATGGG CTTTATCATT AATGTTAAAC CACCCTGAAA CTCTGAAGAA 
1021 AGCACAAGCT GAAATTGATG AACATATAGG ACATGAACGT TTAGTGGACG AGTCGGACAT 
1081 CAACAACCTA CCTTACCTAC GTTGTATAAT CAACGAGACA TTCCGAATGT ACCCTGCAGG 
1141 ACCACTACTA GTCCCACACG AGTCGTCAGA GGAAACCACC GTAGGAGGCT ACCGTGTACC 
1201 CGGAGGAACC ATGTTACTTG TGAATTTGTG GGCAATTCAC AATGATCCAA AGCTATGGGA 
1261 TGAACCAAGA AAGTTTAAAC CAGAAAGATT TCAAGGACTA GATGGTGTTA GAGATGGTTA 
1321 CAAAATGATG CCTTTTGGTT CTGGACGAAG GAGTTGTCCT GGAGAAGGAT TGGCTGTTCG 
1381 AATGGTTGCC TTGTCATTGG GATGTATTAT TCAATGTTTT GATTGGCAAC GAATCGGCGA 
1441 AGAATTGGTT GATATGACTG AAGGAACTGG ACTTACTTTG CCTAAAGCTC AACCTTTGGT 
1501 GGCCAAGTGT AGCCCACGAC CTAAAATGGC TAATCTTCTC TCTCAGATTT GA 

SEQ. ID. NO. 166 

1 MVNMFTPIIY APLLLAFYI I TKHFLRKLRN NPPAPFLTFP FIGHLYLFKK PLQRTLAKIS 
61 ERYGSVLLLE FGSRKVLLVS SPSAAEECLT KNDIIFANRP LLMAGKHLGY NFTSLAWSSY 
121 GDHWRNLRRI TSVEMFSTHR LQMLHGIRID EVKSMVKRLN SSAIAEKSVD MKSMFFELML 
181 NVMMRTIAGK RYYGENVEDI EEATRFKGLV QETFRIGGAT NIGDFLPALK LLVRKLEKSL 
241 IVLQENRDEF MQELIKDCRK RMEKEGTVTD SEIEGNKKCL IEVLLTLQEN EPEYYKDEII 
301 RSLMLVLLSA GTDTSVGTME WALSLMLNHP ETLKKAQAEI DEHIGHERLV DESDINNLPY 
361 LRCIINETFR MYPAGPLLVP HESSEETTVG GYRVPGGTML LVNLWAIHND PKLWDEPRKF 
421 KPERFQGLDG VRDGYKMMPF GSGRRSCPGE GLAVRMVALS LGCIIQCFDW QRIGEELVDM 
481 TEGTGLTLPK AQPLVAKCSP RPKMANLLSQ I 



FIG. 84 



NAME D100A-AC3 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 167 

1 CAAAAACAAA ATTCCAATGG 
61 AGCTTTTTAC AT TAT CACAA 
121 ATTTCTTACT TTCCCCTTTA 
181 CTTAGCCAAA ATCTCCGAAC 
241 AGTACTTTTG GTTTCTTCAC 
301 TTTCGCGAAT CGTCCTCTTT 
361 GGCTTGGAGT TCGTACGGAG 
421 GTTTTCGACT CATCGTCTTC 
481 GGTTAAGAGG CTCAATTCCT 
541 TTTTGAGCTG ATGCTCAATG 
601 GAATGTGGAG GACATTGAGG 
661 GATTGGCGGG GCGACGAATA 
721 ATTGGAGAAA AGTTTAATTG 
781 TAAAGATTGC AGAAAAAGAA 
841 GAACAAGAAA TGTTTAATTG 
901 CAAAGATGAA ATCATCAGAA 
961 AGTTGGGACA ATGGAATGGG 
1021 AGCACAAGCT GAAATTGATG 
1081 CAACAACCTA CCTTACCTAC 
1141 ACCACTACTA GTCCCACACG 
1201 CGGAGGAACC ATGTTACTTG 
1261 TGAACCAAGA AAGTTTAAGC 
1321 CAAAATGATG CCTTTTGGTT 
1381 AATGGTTGCA TTGTCATTGG 
1441 AGGATTGGTT GATAAGACTG 
1501 GGCCAAGTGT AGCCCACGAC 
1561 GGTTTCTACC AAACATCCCC 
1621 AATTTTGAAC CATATTATAT 

SEQ. ID. NO. 168 

1 MVNMFTPIIY APLLLAFYII 
61 ERYGSVLLLE FGSRKVLLVS 
121 GDHWRNLRRI TSVEMFSTHR 
181 NVMMRTIAGK RYYGENVEDI 
241 IVLQENRDEF MQELIKDCRK 
301 RSLMLVLLSA GTDTSVGTME 
361 LRCIINETFR MYPAGPLLVP 
421 KPERFEGLEG VRDGYKMMPF 
481 TEGTGLTLPK AQPLVAKCSP 



TTAACATGTT CACTCCAATT 
AACATTTCTT ACGCAAACTC 
TTGGCCATCT TTATCTCTTC 
GTTATGGCTC TGTTCTTCTA 
CATCTGCAGC TGAAGAATGC 
TGATGGCTGG AAAACATCTT 
ATCACTGGAG AAATCTTCGT 
AAATGCTACA TGGAATTCGT 
CTGCCATAGC TGAAAAATCT 
TTATGATGAG GACAATTGCT 
AAGCTACGAG ATTCAAAGGT 
TTGGCGACTT TTTGCCGGCG 
TGTTGCAAGA GAACAGAGAT 
TGGAGAAAGA AGGTACTGTT 
AAGTTTTGTT AACACTACAA 
GCCTTATGCT TGTTCTATTA 
CTTTATCATT AATGTTAAAC 
AACATATAGG ACATGAACGT 
GTTGTATAAT CAACGAGACA 
AGTCGTCAGA GGAAACCACC 
TGAATTTGTG GGCTATTCAC 
CAGAAAGATT TGAAGGACTA 
CTGGACGAAG GAGTTGTCCT 
GATGTATTAT TCAATGCTTT 
AAGGAACTGG ACTTACTTTG 
CTATAATGGC TAATCTTCTT 
AAACTAGAAT ATTATTATTG 
CTCAATGTAT TCCTTTTTAA 



TKHFLRKLRN NPPAPFLTFP 
SPSAAEECLT KNDIIFANRP 
LQMLHGIRID EVKSMVKRLN 
EEATRFKGLV QETFRI GGAT 
RMEKEGTVTD SEIEGNKKCL 
WALSLMLNHP ETLKKAQAEI 
HESSEETTVG GYRVPGGTML 
GSGRRSCPGE GLAIRMVALS 
RPIMANLLSQ I 



ATATACGCTC CTCTCCTTTT 
AGAAATAACC CACCAGCTCC 
AAAAAACCAC TTCAACGTAC 
CTCGAATTCG GTTCACGAAA 
TTAACAAAAA AC GAT AT TAT 
GGATATAATT TTACTTCTTT 
AGGATTACTT CAGTTGAGAT 
ATTGATGAAG TGAAATCTAT 
GTGGATATGA AGTCTATGTT 
GGAAAAAGAT ATTACGGTGA 
TTGGTGCAAG AGACTTTCAG 
TTGAAGTTAT TGGTGAGGAA 
GAGTTTATGC AGGAATTAAT 
ACTGATTCAG AAATTGAAGG 
GAAAATGAAC CGGAATACTA 
TCAGCTGGTA CAGATACTTC 
CACCCTGAAA CTCTGAAGAA 
TTAGTGGACG AGTCGGACAT 
TTCCGAATGT ACCCTGCAGG 
GTAGGAGGCT ACCGTGTACC 
AATGATCCAA AGCTATGGGA 
GAAGGTGTTA GAGACGGTTA 
GGAGAAGGAT TGGCTATTCG 
GATTGGCAAC GACTTGGGGA 
CCTAAAGCTC AACCTTTAGT 
TCTCAGATTT GAACATAATT 
GTTACATATA CAATGTAATC 
AAAAAAAAAA AAAAA 



FIGHLYLFKK PLQRTLAKIS 
LLMAGKHLGY NFTSLAWSSY 
SSAIAEKSVD MKSMFFELML 
NIGDFLPALK LLVRKLEKSL 
IEVLLTLQEN EPEYYKDEII 
DEHIGHERLV DESDINNLPY 
LVNLWAIHND PKLWDEPRKF 
LGCIIQCFDW QRLGEGLVDK 



FIG. 85 



NAME D104A-AE8 (69,1755) 

ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 169 

1 CAACACGCTT ACTATCTCCT AAATCTCCAC TCAAAAACAA AGAAGAGAAA GATTTAAAAC 
61 TAATAATTAT GAAAGAGATG GTGCAAAACA ATATGAGCAC TTCTCTTCTT GAAACTTTAC 
121 AAGCTACGCC CAT GAT ATT C TACTTCATCG TCCCTCTCTT CTGCTTATTC CTTCTCTCCA 
181 AATCTCGCCG TAAACGTTTG CCTCCAGGTC CAACTGGCTG GCCTCTCATT GGTAACATGA 
241 TGATGATGGA CCAGTTAACT CACCGTGGCC TTGCCAAACT AGCCCAAAAA TATGGTGGTG 
301 TTTTTCACCT TAAAATGGGT TATGTTCACA AAATTGTAGT CTCTGGTCCA GACGAAGCTC 
361 GCCAAGTATT ACAGGAACAC GACATCATAT TTTCGAACCG TCCAGCGACC GTAGCCATAA 
421 GTTACCTAAC ATATGACAGG GCAGACATGG CTTTTGCTGA CTATGGACTC TTCTGGCGGC 
481 AGATGAGAAA ACTATGTGTA ATGAAACTCT TCAGCCGCAA ACGAGCTGAG TCATGGGACT 
541 CAGTTCGAGA CGAAGCGGAT TCCATGGTTA GAATTGTAAC AACCAACACA GGCACAGCTG 
601 TTAACTTAGG TGAACTTGTT TTCAGTCTCA CTCGTAATAT TATCTACAGA GCTGCTTTTG 
661 GAACTTGTTC TGAAGATGGA CAAGGCGAGT TCATTAAAAT TAT GCAAGAG TTTTCGAAGC 
721 TATTTGGTGC TTTCAATATA GCTGATTTTA TTCCATGGCT AGGGTGGGTT GGTAAGCAGA 
781 GTCTAAATAT TAGACTTGCT AAGGCTAGAG CGTCGCTTGA TGGGTTCATT GATTCGATTA 
841 TTGATGACCA TATTATTAGA AAGAAAGCTT ATGTTAATGG CAAAAATGAT GGAGGTGATC 
901 GAGAAACTGA TATGGTGGAT GAGCTTTTAG CTTTTTACAG TGAGGAAGCA AAAGTAACTG 
961 AGTCCGAAGA TTTGCAGAAT GCTATCAGAC TTACTAAGGA TAATATCAAA GCTATCATCA 
1021 TGGATGTAAT GTTTGGAGGG ACAGAAACAG TGGCTTCTGC AATAGAATGG GCCATGGCAG 
1081 AGCTTATGAG GAGTCCTGAA GATCTTAAAA AGGTACAACA AGAGCTGGCT AACGTTGTTG 
1141 GACT CAACAG AAAAGTTGAA GAATCTGACT TTGAAAAATT AACATACTTA AGATGTTGTC 
1201 TAAAAGAAAC TCTACGACTT CACCCTCCAA TCCCTCTCCT CCTCCATGAG ACCGCCGAGG 
1261 AATCCACCGT CTCCGGCTAC CATATTCCGG CAAAGTCACA TGTTATTATA AATTCATTTG 
1321 CCATTGGGCG TGACAAAAAT TCATGGGAAG ATCCTGAAAC TTATAAACCA TCTAGGTTTC 
1381 TCAAAGAAGG TGTACCAGAT TTTAAAGGAG GTAATTTTGA GTTTATACCA TTTGGGTCGG 
1441 GTCGGCGGTC TTGCCCCGGT ATGCAACTTG GGCTTTATGC ATTGGAAATG GCTGTGGCCC 
1501 ATCTTCTTCA TTGTTTTACT TGGGAATTGC CAGATGGTAT GAAACCAAGT GAGCTTAAAA 
1561 TGGATGATAT TTTTGGACTC ACTGCTCCAA GAGCTAATCG ACTCGTGGCT GTGCCTACTC 
1621 CACGTTTGTT GTGTCCCCTT TATTAATTGA AGAAAAAAGG TGGGGCTTTT ACTTGCATCA 
1681 AAGAGTGGTG CTTGTGATTT TTCCACCTTT TGGTTAAATA TACGAATTAT TAT GAT AT AC 
1741 GAATTCTTGG GCACA 



SEQ. ID. NO. 170 

1 MKEMVQNNMS TSLLETLQAT PMIFYFIVPL FCLFLLSKSR RKRLPPGPTG WPLIGNMMMM 

61 DQLTHRGLAK LAQKYGGVFH LKMGYVHKIV VSGPDEARQV LQEHDIIFSN RPATVAISYL 

121 TYDRADMAFA DYGLFWRQMR KLCVMKLFSR KRAESWDSVR DEADSMVRIV TTNTGTAVNL 

181 GELVFSLTRN IIYRAAFGTC SEDGQGEFIK IMQEFSKLFG AFNIADFIPW LGWVGKQSLN 

241 IRLAKARASL DGFIDSIIDD HIIRKKAYVN GKNDGGDRET DMVDELLAFY SEEAKVTESE 

301 DLQNAIRLTK DNIKAIIMDV MFGGTETVAS AIEWAMAELM RSPEDLKKVQ QELANWGLN 

361 RKVEESDFEK LTYLRCCLKE TLRLHPPIPL LLHETAEEST VSGYHIPAKS HVIINSFAIG 

421 RDKNSWEDPE TYKPSRFLKE GVPDFKGGNF EFIPFGSGRR SCPGMQLGLY ALEMAVAHLL 

481 HCFTWELPDG MKPSELKMDD IFGLTAPRAN RLVAVPTPRL LCPLY 



FIG. 86 



NAME D105-AD6 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 171 

1 TGTGCTTGTG AGTGTGGGAG AAGGCCTTCA ATATGGAGAT ACCATATTAC AGCTTAAAAA 
61 TTGCAATTTC TTCATTTGCA ATTATCTTTG TACT AAGAT G GGCATGGAAA ATCTTGAATT 
121 ATGTGTGGTT AAAACCAAAA GAATTGGAGA AATACCTCAG ACAGCAGGGT TTCAAAGGAA 
181 ACTCTTACAA ATTCTTGTTT GGGGATATGA AAGAGATGAA GAAAATGGGT GAAGAAGCTA 
241 TGTCTAAGCC AATCAATTTC TCTCATGACA TGATTTGGCC TAGAGTTATG CCATTCATCC 
301 ACAAAACCAT CACCAATTAT GGTAAGAATT GTATTGTGTG GTTTGGGCCA AGACCAGCAG 
361 TCCTGATCAC AGACCCGGAA CTTGTAAAGG AGGTGCTAAC GAAGAATTTC GTCTATCAGA 
421 AGCCGCTTGG CAATCCACTC ACAAAGTTGG CAGCAACTGG AATTGCAGGC TATGAAACAG 
481 ATAAATGGGC TACACATAGA AGGCTTCTCA ATCCTGCTTT TCACCTTGAC AAGTTGAAGC 
541 ATATGCTACC TGCATTCCAA TTTACTGCTA GTGAGATGTT GAGCAAATTG GAGAAAGTTG 
601 TTTCACCAAA CGGAACAGAG ATAGATGTGT GGCCATATTT ACAAACTTTG ACAAGTGATG 
661 CCATTTCAAG AACTGCGTTT GGAAGTAGTT ATGAAGAAGG AAGAAAGATT TTTGACCTTC 
721 AAAAAGAACA ACTTTCACTA ATTCTAGAAG TTTCACGCAC AATATATATT CCAGGATGGA 
781 GGTTTTTGCC AACGAAAAGG AACAAAAGGA TGAAGCAAAT ATTTAATGAA GTACGAGCAC 
841 TGGTATTTGG AATTATTAAG AAAAGGATGA GTATGATTGA AAATGGAGAA GCACCTGATG 
901 ATTTATTGGG AATATTATTG GCATCCAATT TAAAAGAAAT CCAACAACAT GGAAACAACA 
961 AGAAATTTGG TATGAGTATT GATGAGGTGA TTGAAGAGTG TAAACTCTTC TATTTTGCTG 
1021 GGCAAGAGAC TACTTCATCT TTACTTGTAT GGACTATGAT TTTGTTGTGC AAATATCCTA 
1081 ATTGGCAAGA TAAAGCTAGA GAAGAGGTTT TGCAAGTGTT TGGGAGTAGG GAAGTTGACT 
1141 ATGACAAGTT GAATCAGCTA AAAATAGTAA CTATGATCTT AAACGAGGTC TTAAGGTTGT 
1201 ATCCAGCAGG ATATGTGATT AATCGAATGG TAAACAAAGA AACAAAGTTA GGGAATTTGT 
1261 GTTTACCAGC CGGCGTACAG CTCGTGTTAC CAACAATGTT GTTGCAACAT GATACTGAAA 
1321 TATGGGGAGA TGATGCAATG GAGTTCAATC CAGAGAGATT TAGTGATGGA ATATCCAAAG 
1381 CAACAAAAGG AAAACTTGTG TTTTTTCCAT TTAGTTGGGG TCCAAGAATA TGTATTGGGC 
1441 AAAATTTTGC TATGTTAGAG GCTAAAATGG CAATGGCTAT GATTCTGAAA ACCTATGCAT 
1501 TTGAACTCTC TCCATCTTAT GCTCATGCTC CTCATCCACT ACTACTTCAA CCTCAATATG 
1561 GTGCTCAATT AATTTTGTAC AAGTTGTAGA TATGGTCAAT TTGGAACTTG TTATGGAACT 
1621 TTTATCATTG TAATCAACCA TATTGAGGGA ACATGGTTTG AGGTTAAATC CTCGTGTGTG 
1681 TGTC 

SEQ. ID. NO. 172 

1 MEIPYYSLKI AISSFAIIFV LRWAWKILNY VWLKPKELEK YLRQQGFKGN SYKFLFGDMK 
61 EMKKMGEEAM SKPINFSHDM IWPRVMPFIH KTITNYGKNC IVWFGPRPAV LITDPELVKE 
121 VLTKNFVYQK PLGNPLTKLA ATGIAGYETD KWATHRRLLN PAFHLDKLKH MLPAFQFTAS 
181 EMLSKLEKW SPNGTEIDVW PYLQTLTSDA ISRTAFGSSY EEGRKIFDLQ KEQLSLILEV 
241 SRTIYIPGWR FLPTKRNKRM KQIFNEVRAL VFGIIKKRMS MI ENGEAPDD LLGILLASNL 
301 KEIQQHGNNK KFGMSIDEVI EECKLFYFAG QETTSSLLVW TMILLCKYPN WQDKAREEVL 
361 QVFGSREVDY DKLNQLKIVT MILNEVLRLY PAGYVINRMV NKETKLGNLC LPAGVQLVLP 
421 TMLLQHDTEI WGDDAMEFNP ERFSDGISKA TKGKLVFFPF SWGPRICIGQ NFAMLEAKMA 
481 MAMILKTYAF ELSPSYAHAP HPLLLQPQYG AQLILYKL 



FIG. 87 



NAME D109-AH8 (14,1697) 

ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 173 

1 CCAGCACCAA GACATGGAGA ATTCCTGGGT AGTTTTAGCC TTAACAGGCC TTCTTACATT 
61 AGTTTTTCTC TCAAAGTTTC TTCATAGTCC TCGTCGTAAA CAAAATCTTC CACCAGGTCC 
121 AAAACCATGG CCTATTGTTG GCAATATACA TCTTCTTGGT TCCACCCCTC ACAGATCCCT 
181 TCACGAACTT GCAAAAAGAT ACGGAGATTT AATGCTACTA AAGTTCGGTT CGCGCAATGT 
241 CCTTATTTTA TCCTCCCCAG ATATGGCTAG AGAATTCTTG AAAACAAATG ATGCCATTTG 
301 GGCTTCTCGC CCTGAGCTTG CCGCTGGTAA ATATACTGCT TATAATTATT GCGACATGAC 
361 ATGGGCACGT TATGGACCCT TTTGGAGACA AGCAAGGAGG ATCTATCTCA ACGAGATTTT 
421 CAATCCTAAA CGTTTGGATT CATTTGAGTA CATTCGCATA GAGGAAAGGC ATAATTTGAT 
481 TTCACGTCTT TTTGTTCTCT CTGGGAAGCC AATTCTTCTT AGAGACCATT TAACTCGGTA 
541 CACTCTTACA AGTATAAGTA GAACAGTATT GAGTGGAAAA TATTTTAGCG AGTCACCTGG 
601 CCAAAATTCA ATGATAACTT TGAAACAATT GCAGGATATG CTTGATAAGT GGTTTTTGCT 
661 TAATGGTGTG ATCAATATTG GGGACTGGAT ACCTTGGCTT GCTTTCTTGG ATTTGCAGGG 
721 TTATGTCAAG CAAATGAAGG AGTTGCATAG GAACTTCGAC AAATTTCATA ACTTTGTGCT 
7 81 AGATGATCAC AAGGCTAATA GGGGAGAGAA GAACTTTGTG CCAAGAGACA TGGTCGATGT 
841 TTTGCTGCAG CAAGCTGAGG ATCCTAATCT TGAGGTCAAA CTCACCAATG ATTGTGTCAA 
901 GGGTCTAATG CAGGACTTAT TGGCTGGCGG CACGGACACC TCAGCAACAA CCGTTGAATG 
961 GGCTTTTTAT GAACTTCTTA GACAACCTAA GATTATGAAG AAAGCACAAC AAGAGCTAGA 
1021 CCTTGTCATT TCACAGGACA GATGGGTTCA AGAAAAAGAT TACACTCAAC TCCCTTACAT 
1081 TGAGTCAATC ATCAAGGAAA GATTGAGGCT TCACCCAGTA AGCACCATGC TTCCACCGCG 
1141 CATTGCCTTG GAGGATTGTC ATGTAGCAGG CTATGACATA CCTAAAGGTA CAATTTTAAT 
1201 TGTGAACACT TGGAGTATTG GAAGAAATTC ACAGCATTGG GAGTCACCAG AAGAATTCCT 
1261 TCCGGAGAGG TTTGAAGGGA AGAATATTGG TGTCACAGGA CAACATTTTG CGCTCTTGCC 
1321 ATTTGGCGCG GGCCGGAGAA AGTGCCCAGG ATACAGTCTT GGGATTCGTA TAATTAGGGC 
1381 AACTTTAGCT AACTTGTTGC ATGGATTCAA CTGGAGATTG CCTAATGGTA TGAGTCCAGA 
1441 AGACATTAGC ATGGAAGAGA TTTATGGGCT AATTACACAC CCCAAAGTCG CACTTGACGT 
1501 GATGATGGAG CCTCGACTTC CCAACCATCT TTACAAATAG TGGATAATTA AAACCATTAA 
1561 AATCGTTTTG TTATATGCAT GTCTCATATT TGTAGTGGTC AAAATGTTTG TTTTCTATCA 
1621 TGGATGTTCA GTGCGAGGTT GGGAATTTCA AGTCATTAAC GTGTGAAAAT ATTTTAAATT 
1681 TAAAAAAAAA AAAAAAA 

SEQ. ID. NO. 174 

1 MENSWWLAL TGLLTLVFLS KFLHSPRRKQ NLPPGPKPWP IVGNIHLLGS TPHRSLHELA 
61 KRYGDLMLLK FGSRNVLILS SPDMAREFLK TNDAIWASRP ELAAGKYTAY NYCDMTWARY 
121 GPFWRQARRI YLNEIFNPKR LDSFEYIRIE ERHNLISRLF VLSGKPILLR DHLTRYTLTS 
181 ISRTVLSGKY FSESPGQNSM ITLKQLQDML DKWFLLNGVT NIGDWIPWLA FLDLQGYVKQ 
241 MKELHRNFDK FHNFVLDDHK ANRGEKNFVP RDMVDVLLQQ AEDPNLEVKL TNDCVKGLMQ 
301 DLLAGGTDTS ATTVEWAFYE LLRQPKIMKK AQQELDLVIS QDRWVQEKDY TQLPYIESII 
361 KETLRLHPVS TMLPPRIALE DCHVAGYDIP KGTILIVNTW SIGRNSQHWE SPEEFLPERF 
421 EGKNIGVTGQ HFALLPFGAG RRKCPGYSLG IRIIRATLAN LLHGFNWRLP NGMSPEDISM 
481 EEIYGLITHP KVALDVMMEP RLPNHLYK 
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NAME D110-AF12 (166,1631) 

ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 175 

1 ACTGTTCAAA TCACAGTAAC AGCATCTTGT GCTGCCATAA TAATTACTCT AGTGGTGTGT 
61 AT AT GGAGAG TGCTGAATTG GGTTTGGTTC AGACCAAAGA AGCTGGAAAA GCTACTGAGG 
121 AAACAAGGTC TCAAAGGCAA TTCCTACAGG ATTTTGTATG GGGATATGAA GGAGCTTTCT 
181 GGTATGATTA AGGAAGCTAA CTCCAAACCC ATGAATCTTT CT GAT GAT AT TGCCCCAAGA 
241 TTGGTCCCTT TCTTTCTTGA TACCATCAAG AAATATGGGA AAAAATCCTT TGTATGGTTG 
301 GGTCCAAAAC CGCTGGTTTT TGTCATGGAC CCCGAGCTTA TAAAGGAAGT ATTCTCCAAA 
361 AACTATCTGT ATCAAAAGCC TCATTCAAAT CCATTAACCA AGTTACTGGC ACAAGGACTT 
421 GTAAGCCAAG AGGAAGACAA ATGGGCCAAA CATAGAAAAA TCGTCACTCC TGCCTTCCAC 
481 CTGGAGAAGC TAAAGCATAT GCTTCCAGCT TTTTGTTTGA GCTGTACTGA GATGCTGAGC 
541 AAATGGGAAG ACATTGTTGC AGTTGAGGGC TCACATGAGA TAGATATATG GCCTGGCCTT 
601 CAACAATTAA CTAGTGATGT GATCTCTCGG ACAGCCTTTG GCAGTAGCTA TGAAGCAGGT 
661 AGAAGGATAT TTGAACTTCA AAAGGAACAA GCTCAATTTC TTATGGAAGC TATACGCTCC 
721 GTTTATATTC CAGGCTGGAG GTTTTTGCCA ACAAAGAGGA ACAGAAGAAT GAAGGAAATT 
781 GAAAAGGATG TTCAAGCCTT AGTTAGAGGT ATTATTGATA AAAGAGTAAA GTCAATGAAA 
841 GCAGGAGAGG TGAATAATGA GGATCTGCTT GGTATATTGC TGGAATCTAA TTTTAAAGAA 
901 ATTGAACAGC ATGGAAACAA GGATTTTGGA ATGAGCATTG AAGAAGTCAT TCAAGAATGC 
961 AAGTTATTCT ATTTTGCTGG CCAAGAAACT ACATCAGTGT TGCTTGTATG GACTCTAATA 
1021 TTGCTGAGCA GGCATCAGGA TTGGCAAGCA CTGGCCAGAG AAGAGGTGTT GCAAGTCTTT 
1081 GGGAATCAGA AACCAGATTT TGATGGATTA AATCGTCTAA AAATTGTTAC AATGATCTTG 
1141 TACGAGTCTT TAAGGCTCTA TCCCCCAGTA GTGACACTTA CCCGAAGGCC TAAGGAAGAC 
1201 ACTGTATTAG GAGATGTATC TCTACCAGCA GGTGTGTTAA TCTCCTTACC AGTGATCTTA 
1261 TTGCATCACG ACGAAGAGAT ATGGGGTAAA GATGCAAAGA AGTTCAAGCC AGAGAGATTC 
1321 AGAGATGGAG TCTCAAGTGC AACAAAGGGT CAAGTCACTT TTTTCCCATT TACTTGGGGT 
1381 CCCAGAATAT GCATTGGACA AAATTTTGCC ATGTTAGAAG CAAAGACTAC TTTGGCTATG 
1441 AT C CT ACAAC GCTTCTCCTT TGAACTGTCT CCATCTTATG CACATGCTCC TCAGTCCATA 
1501 ATAACTTTGC AACCCCAGTA TGGTGCTCCA CTTATTTTGC ATAAAATATA GTT TAT TACT 
1561 TGTAAGTAGT GTCTCGTTTT ATGTTAAGCA TGAGTCCAAA ATGTTAAGGC TTGTAGAACT 
1621 GCAAAATGGG A 

SEQ. ID. NO. 176 

1 MKELSGMIKE ANSKPMNLSD DIAPRLVPFF LDTIKKYGKK SFVWLGPKPL VFVMDPELIK 
61 EVFSKNYLYQ KPHSNPLTKL LAQGLVSQEE DKWAKHRKIV TPAFHLEKLK HMLPAFCLSC 
121 TEMLSKWEDI VAVEGSHEID IWPGLQQLTS DVISRTAFGS SYEAGRRIFE LQKEQAQFLM 
181 EAIRSVYIPG WRFLPTKRNR RMKEIEKDVQ ALVRGIIDKR VKSMKAGEVN NEDLLGILLE 
241 SNFKEIEQHG NKDFGMSIEE VIQECKLFYF AGQETTSVLL VWTLILLSRH QDWQALAREE 
301 VLQVFGNQKP DFDGLNRLKI VTMI LYE SLR LYPPWTLTR RPKEDTVLGD VSLPAGVLIS 
361 LPVILLHHDE EIWGKDAKKF KPERFRDGVS SATKGQVTFF PFTWGPRICI GQNFAMLEAK 
421 TTLAMILQRF SFELSPSYAH APQSIITLQP QYGAPLILHK I 
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NAME D112-AA5 
ORGANISM NICOTIANA TABACUM 

SEQ. ID. NO 1^ GAAAATGCAA TTCTTCAGCT TGGTTTCCAT TTTCCTATTC CTATCTTTCC 

61 TATTTTTGTT GAGGAAATGG AAGAACTCCA ATAGCCAAAG CAAAAAATTG CCACCAGGTC 
121 Stggaaaat ACCAATACTA GGAAGTATGC TTCATATGAT TGGTGGAGAA ccgcaccatg 
181 TCCTTAGAGA TTTAGCCAAA AAAGATGGAC CACTTATGCA CCTTCAGTTA GGTGAAATTT 

2" ctgcIgttgt GGTTACTTCT agggacatgg caaaagaagt gctaaaaact catgacgtcg 
301 tttttgcatc taggcctaaa attgtagcca tggacattat ctgttataac cagtccgaca 

361 TTGCCTTTAG CCCTTATGGC GACCACTGGA GACAAATGCG TAAAATTTGT GTCATGGAAC 

421 "cSgc a^ga^tgtt cggtctttca gctccatcag acgtgatgaa gtcgttcgtc 

481 TCATTGACTC TATCCGGTCA GATTCTTCTT CAGGTGAGCT AGTTAATTTT ACGCAGAGGA 
541 TCATTTGGTT TGCAAGCTCC ATGACGTGTA GATCAGCATT TGGGCAAGTA CTCAAGGGGC 
601 AAGACATATT TGCCAAAAAG ATCAGAGAAG TAATAGGATT AGCAGAAGGC TTTGATGTGG 
Hi ™ScTT CCCTACATAC AAGTTTCTTC ATGTTCTCAG TGGGATGAAG CGTAAACTTT 
721 S^CCCA CCTTAAGGTA GACGCCATTG TTGAGGATGT CATCAACGAG CACAAGAAAA 
781 ATCTTGCAGC TGGCAAGAGT AATGGCGCAT TAGGAGGCGA AGATCTAATT GATGTCCTAC 
841 TGAGACTTAT GAATGACACA AGTCTTCAAT TTCCCATCAC CAACGACAAT ATCAAAGCTG 
901 TTGTTGTTGA CATGTTTGCT GCCGGAACAG AAACTTCATC AACAACAACT GTATGGGCCA 

961 ?ggctgIaat GATGAAGAAT ccaagtgtat TCGCCAAAGC tcaagcagaa gtgcgagaag 
1021 cctttaggga caaagtatct tttgatgaaa atgatgtgga ggagctgaaa tacttaaagt 
10I1 tag£cat?aa aSaactttg agacttcatc caccgtctcc acttttggtc ccaagagaat 
1141 gcagggaaga tacggatata aacggctaca ctattcctgc aaagaccaaa gttatggtta 

1201 ATGTTTGGGC ATTGGGAAGA GATCCAAAAT ATTGGGATGA CGCGGAAAGC TTTAAGCCAG 
llll AGAGATTTGA ScAATGTTCT GTAGATATTT TTGGTAATAA TTTTGAGTTT CTTCCCTTTG 
1221 GCGGGGGAC G GAGAATTTGT CCTGGAATGT CATTTGGTTT AGCTAATCTT TACTTACCAT 
1381 TGGCTCAATT ACTCTATCAC TTTGACTGGA AACTCCCAAC CGGAATCAAG CCAAGAGACT 
1441 TGGACTTGAC CGAATTATCG GGAATAACTA TTGCTAGAAA GGGTGACCTT TACTTAAATG 
1501 CTACTCCTTA TCAACCTTCT CGAGAGTAAT TTACTATTGG CATAAACATT TTAAATTTCC 
1561 TTCATCAACC TC 

SEQ. ID. NO- 178 LFLSFLFLLR KWKNSNSQSK KLPPGPWKIP ILGSMLHMIG GEPHHVLRDL 

61 AKKDGPLMHL QLGEISAVW TSRDMAKEVL KTHDWFASR PKIVAMDIIC ™QSDIAFSP 
121 YGDHWRQMRK ICVMELLNAK NVRSFSSIRR DEWRLIDSI RSDSSSGELV NFTQRIIWFA 
181 SSMTCRSAFG QVLKGQDIFA KKIREVIGLA EGFDWDIFP TYKFLHVLSG MKRKLLNAHL 
241 KVDAIVEDVI NEHKKNIAAG KSNGALGGED LIDVLLRLMN DTSLQFPITN DNIKAVWDM 
301 FAAGTETSST TTWAMAEMM KNPSVFAKAQ AEVREAFRDK VSFDENDVEE LKYLKLVIKE 
361 TLRLHPPSPL LVPRECREDT DINGYTI PAK TKVMVNWAL GRDPKYWDDA ESFKPERFEQ 
421 CSVDIFGNNF EFLPFGGGRR ICPGMSFGLA NLYLPLAQLL YHFDWKLPTG IKPRDLDLTE 
481 LSGITIARKG DLYLNATPYQ PSRE 
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NAME D120-AH4 

ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 179 

1 ATAATGCTTT CTCCCATAGA AGCCATTGTA GGACTAGTAA CCTTCACATT TCTCTTCTTC 
61 TTCCTATGGA CAAAAAAATC TCAAAAACCT TCAAAACCCT TACCACCGAA AATCCCCGGA 
121 GGATGGCCGG TAATCGGCCA TCTTTTCCAC TTCAATGACG ACGGCGACGA CCGTCCATTA 
181 GCTCGAAAAC TCGGAGACTT AGCTGACAAA TACGGCCCCG TTTTCACTTT TCGGCTAGGC 
241 CTTCCCCTTG TCTTAGTTGT AAGCAGTTAC GAAGCTGTAA AAGACTGTTT CTCTACAAAT 
301 GACGCCATTT TTTCCAATCG TCCAGCTTTT CTTTACGGCG ATTACCTTGG CTACAATAAT 
361 GCCATGCTAT TTTTGGCCAA TTACGGACCT TACTGGCGAA AAAATCGAAA ATTAGTTATT 
421 CAGGAAGTTC TCTCCGCTAG TCGTCTCGAA AAATTCAAAC ACGTGAGATT TGCAAGAATT 
481 CAAGCGAGCA TTAAGAATTT ATATACTCGA ATTGATGGAA ATTCGAGTAC GATAAATTTA 
541 ACTGATTGGT TAGAAGAATT GAATTTTGGT CTGATCGTGA AGATGATCGC TGGAAAAAAT 
601 TATGAATCCG GTAAAGGAGA TGAACAAGTG GAGAGATTTA AGAAAGCGTT TAAGGATTTT 
661 ATGATTTTAT CAATGGAGTT TGTGTTATGG GATGCATTTC CAATTCCATT ATTTAAATGG 
721 GTGGATTTTC AAGGGCATGT TAAGGCTATG AAAAGGACTT TTAAAGATAT AGATTCTGTT 
781 TTTCAGAATT GGTTAGGGGA ACATATTAAT AAAAGAGAAA AAAT GGAGGT TAATGCAGAA 
841 GGGAATGAAC AAGATTTCAT TGATGTGGTG CTTTCAAAAA TGAGTAATGA ATATCTTGGT 
901 GAAGGTTACT CTCGTGATAC TGTCATTAAA GCAACGGTGT TTAGTTTGGT CTTGGATGCA 
961 GCAGACACAG TTGCTCTTCA CATAAATTGG GGAATGGCAT TATTGATAAA CAATCAAAAG 
1021 GCCTTGACGA AAGCACAAGA AGAGATAGAC ACAAAAGTTG GTAAGGACAG ATGGGTAGAA 
1081 GAGAGTGATA TTAAGGATTT GGTATACCTC CAAGCTATTG TTAAAGAAGT GTTACGATTA 
1141 TATCCACCAG GACCTTTGTT AGTACCACAC GAAAATGTAG AAGATTGTGT TGTTAGTGGA 
1201 TATCACATTC CTAAAGGGAC AAGATTATTC GCAAACGTCA TGAAACTGCT ACGTGATCCT 
1261 AAACTCTGGC CTGATCCTGA TACTTTCGAT CCAGAGAGAT TCATTGCTAC TGATATTGAC 
1321 TTTCGTGGTC AGTACTATAA GTATATCCCG TTTGGTTCTG GAAGACGATC TTGTCCAGGG 
1381 ATGACTTATG CATTGCAAGT GGAACACTTA ACAATGGCAC ATTTGATCCA AGGTTTCAAT 
1441 TACAGAACTC CAAATGACGA GCCCTTGGAT ATGAAGGAAG GTGCAGGCAT AACTATACGT 
1501 AAGGTAAATC CTGTGGAACT GATAATAGCG CCTCGCCTGG CACCTGAGCT TTATTAAAAC 
1561 CTAAGATCTT TCATCTTGGT TGATCATTGT ATAATACTCC TAAATGGATA TTCATTTACC 
1621 TTTTATCAAT TAA 

SEQ. ID. NO. 180 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK FKHVRFARIQ ASIKNLYTRI DGNSSTINLT 
181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 
241 DFQGHVKAMK RTFKDIDSVF QNWLGEHINK REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
301 GYSRDTVIKA TVFSLVLDAA DTVALHINWG MALLINNQKA LTKAQEEIDT KVGKDRWVEE 
361 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE NVEDCWSGY HIPKGTRLFA NVMKLLRDPK 
421 LWPDPDTFDP ERFIATDIDF RGQYYKYIPF GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP RLAPELY 
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NAME D121-AA8 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 181 

1 AATCCATAAT GCTTTCTCCC ATAGAAGCCA TTGTAGGACT AGTAACCTTC ACATTTCTCT 
61 TCTTCTTCCT ATGGACAAAA AAATCTCAAA AACCTTCAAA ACCCTTACCA CCGAAAATCC 
121 CCGGAGGATG GCCGGTAATC GGCCATCTTT TCCACTTCAA TGACGACGGC GACGACCGTC 
181 CATTAGCTCG AAAACTCGGA GACTTAGCTG ACAAATACGG CCCCGTTTTC ACTTTTCGGC 
241 TAGGCCTTCC CCTTGTCTTA GTTGTAAGCA GTTACGAAGC TGTAAAAGAC TGTTTCTCTA 
301 CAAATGACGC CATTTTTTCC AATCGTCCAG CTTTTCTTTA CGGCGATTAC CTTGGCTACA 
361 ATAATGCCAT GCTATTTTTG GCCAATTACG GACCTTACTG GCGAAAAAAT CGAAAATTAG 
421 TTATTCAGGA AGTTCTCTCC GCTAGTCGTC TCGAAAAATT CAAACACGTG AGATTTGCAA 
481 GAATTCAAGC GAGCATTAAG AATTTATATA CTCGAATTGA TGGAAATTCG AGTACGATAA 
541 ATTTAACTGA TTGGTTAGAA GAATTGAATT TTGGTCTGAT CGTGAAGATG ATCGCTGGAA 
601 AAAATTATGA ATCCGGTAAA GGAGATGAAC AAGTGGAGAG ATTTAAGAAA GCGTTTAAGG 
661 ATTTTATGAT TTTATCAATG GAGTTTGTGT TATGGGATGC ATTTCCAATT CCATTATTTA 
721 AATGGGTGGA TTTTCAAGGG CATGTTAAGG CTATGAAAAG GACTTTTAAA GATATAGATT 
781 CTGTTTTTCA GAATTGGTTA GAGGAACATA TTAATAAAAG AGAAAAAATG GAGGTTAATG 
841 CAGAAGGGAA TGAACAAGAT TTCATTGATG TGGTGCTTTC AAAAATGAGT AATGAATATC 
901 TTGGTGAAGG TTACTCTCGT GATACTGTCA TTAAAGCAAC GGTGTTTAGT TTGGTCTTGG 
961 ATGCAGCAGA CACAGTTGCT CTTCACATAA ATTGGGGAAT GGCATTATTG ATAAACAATC 
1021 AAAAGGCCTT GACGAAAGCA CAAGAAGAGA TAGACACAAA AGTTGGTAAG GACAGATGGG 
1081 TAGAAGAGAG TGATATTAAG GATTTGGTAT ACCTCCAAGC TATTGTTAAA GAAGT GTTAC 
1141 GATTATATCC ACCAGGACCT TTGTTAGTAC CACACGAAAA TGTAGAAGAT TGTGTTGTTA 
1201 GTGGATATCA CATTCCTAAA GGGACAAGAT TATTCGCAAA CGTCATGAAA CTGCAACGTG 
1261 ATCCTAAACT CTGGTCTGAT CCTGATACTT TCGATCCAGA GAGATTCATT GCTACTGATA 
1321 TTGACTTTCG TGGTCAGTAC TATAAGTATA TCCCGTTTGG TTCTGGAAGA CGATCTTGTC 
1381 CAGGGATGAC TTATGCATTG CAAGTGGAAC ACTTAACAAT GGCACATTTG ATCCAAGGTT 
1441 TCAATTACAG AACTCCAAAT GACGAGCCCT TGGATATGAA GGAAGGTGCA GGCATAACTA 
1501 TACGTAAGGT AAATCCTGTG GAACTGATAA TAGCGCCTCG CCTGGCACCT GAGCTTTATT 
1561 AAAACCTAAG ATCATCTTGC TTGAT 

SEQ. ID. NO. 182 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK FKHVRFARIQ ASIKNLYTRI DGNSSTINLT 
181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 
241 DFQGHVKAMK RTFKDIDSVF QNWLEEHINK REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
301 GYSRDTVIKA TVFSLVLDAA DTVALHINWG MALLINNQKA LTKAQEEIDT KVGKDRWVEE 
361 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE NVEDCWSGY HIPKGTRLFA NVMKLQRDPK 
421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP RLAPELY 



FIG. 92 



NAME D122-AF10 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 183 

1 CTAAAACTCC ATAATGGTTT CTCCCGTAGA AGCCATTGTA GGACTAGTAA CCCTTACACT 
61 TCTCTTCTAC TTCCTATGGC CCAAAAAATT TCAAATACCT TCAAAACCAT TACCACCGAA 
121 AATTCCCGGA GGGTGGCCGG TAATCGGCCA TCTTTTCTAC TTCGATGATG ACGGCGACGA 
181 CCGTCCATTA GCTCGAAAAC TCGGAGACTT AGCTGACAAA TACGGCCCGG TTTTCACTTT 
241 CCGGCTAGGC CTTCCGCTTG TGTTAATTGT AAGCAGTTAC GAAGCTGTAA AAGACTGCTT 
301 CTCTACAAAT GACGCCATTT TCTCCAATCG TCCAGCTTTT CTTTACGGTG AATACCTTGG 
361 CTACAATAAT GCCATGCTAT TTTTGACAAA ATACGGACCT TATTGGCGAA AAAATAGAAA 
421 ATTAGTCATT CAGGAAGTTC TCTCTGCTAG TCGTCTCGAA AAATTGAAGC ACGTGAGATT 
481 TGGTAAAATT CAAACGAGCA TTAAGAGTTT ATACACTCGA ATTGATGGAA ATTCGAGTAC 
541 GATAAATCTA ACTGATTGGT TAGAAGAATT GAATTTTGGT CTGATCGTGA AAATGATCGC 
601 TGGGAAAAAT TATGAATCCG GTAAAGGAGA TGAACAAGTG GAGAGATTTA GGAAAGCGTA 
661 TAAGGATTTT ATAATTTTAT CAATGGAGTT TGTGTTATGG GATGCTTTTC CAATTCCATT 
721 GTTCAAATGG GTGGATTTTC AAGGCTATGT TAAGGCCATG AAAAGGACAT TTAAGGATAT 
781 AGATTCTGTT TTTCAGAATT GGTTAGAGGA ACATGTCAAG AAAAGAGAAA AAATGGAGGT 
841 TAATGCACAA GGGAAT GAAC AAGATTTCAT TGATGTGGTG CTTTCAAAAA TGAGTAATGA 
901 ATATCTTGAT GAAGGTTACT CTCGTGATAC TGTCATAAAA GCAACAGTGT TTAGTTTGGT 
961 CTTGGATGCT GCGGACACAG TTGCTCTTCA CATGAATTGG GGAATGGCAT TACT GAT AAA 
1021 CAATCAACAT GCCTTGAAGA AAGCACAAGA AGAGATCGAT AAGAAAGTTG GTAAGGAAAG 
1081 ATGGGTAGAA GAGAGT GATA TTAAGGATTT GGTCTACCTC CAAGCTATTG TTAAAGAAGT 
1141 GTTACGATTA TATCCACCAG GACCTTTATT AGTACCTCAT GAAAATGTAG AGGATTGTGT 
1201 TGTTAGTGGA TATCACATTC CTAAAGGGAC TAGACTATTC GCGAACGTTA TGAAATTGCA 
1261 GCGCGATCCT AAACTCTGGT CAAATCCTGA TAAGTTTGAT CCAGAGAGAT TCTTCGCTGA 
1321 TGATATTGAC TACCGTGGTC AGCACTATGA GTTTATCCCA TTTGGTTCTG GAAGACGATC 
1381 TTGTCCGGGG ATGACTTATG CATTACAAGT GGAACACCTA ACAATAGCAC ATTTGATCCA 
1441 GGGTTTCAAT TACAAAACTC CAAATGACGA GCCCTTGGAT ATGAAGGAAG GTGCAGGATT 
1501 AACTATACGT AAAGTAAATC CTGTAGAAGT GACAATTACG GCTCGCCTGG CACCTGAGCT 
1561 TTATTAAAAC CTTAGATGTT TTATCTTGAT TGTACTAATA TATATATGCA GAAAAAATTG 

SEQ. ID. NO. 184 

1 MVSPVEAIVG LVTLTLLFYF LWPKKFQIPS KPLPPKIPGG WPVIGHLFYF DDDGDDRPLA 
61 RKLGDLADKY GPVFTFRLGL PLVLIVSSYE AVKDCFSTND AIFSNRPAFL YGEYLGYNNA 
121 MLFLTKYGPY WRKNRKLVIQ EVLSASRLEK LKHVRFGKIQ TSIKSLYTRI DGNSSTINLT 
181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE RFRKAYKDFI ILSMEFVLWD AFPIPLFKWV 
241 DFQGYVKAMK RTFKDIDSVF QNWLEEHVKK REKMEVNAQG NEQDFIDWL SKMSNEYLDE 
301 GYSRDTVIKA TVFSLVLDAA DTVALHMNWG MALLINNQHA LKKAQEEIDK KVGKERWVEE 
361 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE NVEDCWSGY HIPKGTRLFA NVMKLQRDPK 
421 LWSNPDKFDP ERFFADDIDY RGQHYEFIPF GSGRRSCPGM TYALQVEHLT IAHLIQGFNY 
481 KTPNDEPLDM KEGAGLTIRK VNPVEVTITA RLAPELY 
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NAME D128-AB7 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 185 

1 CGAGGCTCCC CACCAAAAAA TCATTTCTCT CGTCTAAAAT GGATCTTCTC TTACTAGAGA 
61 AGACCTTAAT TGGTCTTTTC TTTGCCATTT TAATCGCTTT AATTGTCTCT AAACTTCGTT 
121 CAAAGCGTTT TAAGCTTCCT CCAGGACCAA TTCCAGTACC AGTTTTTGGT AATTGGCTTC 
181 AAGTTGGTGA TGATTTAAAC CACAGAAATC TTACTGATTA TGCCAAAAAA TTTGGCGATC 
241 TTTTCTTGTT AAGAATGGGT CAACGTAACT TAGTTGTTGT GTCATCTCCT GAAT TAGCTA 
301 AAGAAGTTTT ACACACACAA GGTGTTGAAT TTGGTTCAAG AACAAGAAAT GTTGTGTTTG 
361 ATATTTTTAC TGGAAAAGGT CAAGATATGG TTTTTACTGT ATATGGTGAA CATT GGAGAA 
421 AAATGAGGAG AATTATGACT GTACCATTTT TTACTAATAA AGTTGTGCAA CAGTATAGAG 
481 GGGGGTGGGA GTTTGAGGTG GCAAGTGTAA TTGAGGATGT GAAAAAAAAT CCTGAATCTG 
541 CTACTAATGG GATCGTATTA AGGAGGAGAT TACAATTAAT GATGTATAAT AATATGTTTA 
601 GGATTATGTT T GAT AGGAGA TTTGAGAGTG AAGATGATCC TTTGTTTGTT AAGCTTAAGG 
661 CTTTGAATGG TGAAAGGAGT AGATTGGCTC AAAGTTTTGA GTATAATTAT GGTGATTTTA 
721 TTCCAATTTT GAGGCCTTTT TTGAGAGGTT ATTTGAAGAT CTGTAAAGAA GTTAAGGAGA 
781 AGAGGCTGCA GCTTTTCAAA GATTACTTTG TTGATGAAAG AAAGAAGCTT TCAAATACCA 
841 AGAGCTCGGA CAGCAATGCC CTAAAATGTG CGATTGATCA CATTCTTGAG GCTCAACAGA 
901 AGGGAGAGAT CAATGAGGAC AACGTTCTTT ACATTGTTGA AAACATCAAT GTTGCTGCAA 
961 TTGAAACAAC ATTATGGTCA ATTGAGTGGG GTATCGCCGA GCTAGTCAAC CACCCTCACA 
1021 TCCAAAAGAA ACTGCGCGAC GAGATTGACA CAGTTCTTGG ACCAGGAGTG CAAGTGACTG 
1081 AACCAGACAC CCACAAGCTT CCATACCTTC AGGCTGTGAT CAAGGAGGCA CTTCGTCTCC 
1141 GTATGGCAAT TCCTCTATTA GTCCCACACA TGAACCTTCA CGACGCAAAG CTTGGCGGGT 
1201 TTGATATTCC AGCAGAGAGC AAAATCTTGG TTAACGCTTG GTGGTTAGCT AACAACCCGG 
1261 CTCATTGGAA GAAACCCGAA GAGTTCAGAC CCGAGAGGTT CTTTGAAGAG GAGAAGCATG 
1321 TTGAGGCCAA TGGCAATGAC TTCAGATATC TTCCGTTTGG CGTTGGTAGG AGGAGCTGCC 
1381 CTGGAATTAT ACTTGCATTG CCAATTCTTG GCATCACTTT GGGACGTTTG GTTCAGAACT 
1441 TTGAGCTGTT GCCTCCTCCA GGCCAGTCGA AGCTCGACAC CACAGAGAAA GGTGGACAGT 
1501 TCAGTCTCCA CATTTTGAAG CATTCCACCA TTGTGTTGAA ACCAAGGTCT TTCTGAACTT 
1561 TGTGATCTTA TTAATTAAGG GGTTCTGAAG AAATTTGATA GTGTTGGATA TTAAGGGCGA 
1621 ATT 

SEQ. ID. NO. 186 

1 MDLLLLEKTL IGLFFAILIA LIVSKLRSKR FKLPPGPIPV PVFGNWLQVG DDLNHRNLTD 

61 YAKKFGDLFL LRMGQRNLW VSSPELAKEV LHTQGVEFGS RTRNWFDIF TGKGQDMVFT 

121 VYGEHWRKMR RIMTVPFFTN KWQQYRGGW EFEVASVIED VKKNPESATN GIVLRRRLQL 

181 MMYNNMFRIM FDRRFESEDD PLFVKLKALN GERSRLAQSF EYNYGDFIPI LRPFLRGYLK 

241 ICKEVKEKRL QLFKDYFVDE RKKLSNTKSS DSNALKCAID HILEAQQKGE INEDNVLYIV 

301 ENINVAAIET TLWSIEWGIA ELVNHPHIQK KLRDEIDTVL GPGVQVTEPD THKLPYLQAV 
361 IKEALRLRMA IPLLVPHMNL HDAKLGGFDI PAESKILVNA WWLANNPAHW KKPEEFRPER 

421 FFEEEKHVEA NGNDFRYLPF GVGRRSCPGI ILALPILGIT LGRLVQNFEL LPPPGQSKLD 
481 TTEKGGQFSL HILKHSTIVL KPRSF 
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NAME D129-AD10 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 187 

1 CAACACGCTT ACTATCTCCT AAATCTCCAC TCAAAAACAA AGAAGAGAAA GATTTAAAAC 
61 TAATAATTAT GAAAGAGATG GTGCAAAACA ATATGAGCAC TTCTCTTCTT GAAACTTTAC 
121 AAGCTACGCC CAT GAT ATT C TACT T CAT CG TCCCTCTCTT CTGCTTATTC CTTCTCTCCA 
181 AATCTCGCCG TAAACGTTTG CCTCCAGGTC CAACTGGCTG GCCTCTCATT GGTAACATGA 
241 TGATGATGGA CCAGTTAACT CACCGTGGCC TTGCCAAACT AGCCCAAAAA TATGGTGGTG 
301 TTTTTCACCT TAAAATGGGT TATGTTCACA AAATTGTAGT CTCTGGTCCA GACGAAGCTC 
361 GCCAAGTATT ACAGGAACAC GACATCATAT TTTCGAACCG TCCAGCGACC GTAGCCATAA 
421 GTTACCTAAC ATATGACAGG GCAGACATGG CTTTTGCTGA CTATGGACTC TTCTGGCGGC 
4 81 AGATGAGAAA ACTATGTGTA ATGAAACTCT TCAGCCGCAA ACGAGCTGAG TCATGGGACT 
541 CAGTTCGAGA CGAAGCGGAT TCCATGGTTA GAATTGTAAC AACCAACACA GGCACAGCTG 
601 TTAACTTAGG TGAACTTGTT TTCAGTCTCA CTCGTAATAT TATCTACAGA GCTGCTTTTG 
661 GAACTTGTTC TGAAGATGGA CAAGGCGAGT TCATTGAAAT TATGCAAGAG TTTTCGAAGC 
721 TATTTGGCGC TTT CAATATA GCTGATTTTA TTCCATGGCT AGGGTGGGTT GGTAAGCAGA 
781 GTCTAAATAT TAGACTTGCT AAGGCTAGAG CGTCGCTTGA TGGGTTCATT GATTCGATTA 
841 TTGATGACCA TATTATTAGA AAGAAAGCTT ATGTTAATGG CAAAAATGAT GGAGGTGATC 
901 GAGAAACTGA TATGGTGGAT GAGCTTTTAG CTTTTTACAG TGAGGAAGCA AAAGTAACTG 
961 AGTCCGAAGA TTTGCAGAAT GCTATCAGAC TTACTAAGGA TAGTATCAAA GCTATCATCA 
1021 TGGATGTAAT GTTTGGAGGG ACAGAAACAG TGGCTTCTGC AATAGAATGG GCCATGGCAG 
1081 AGCTTATGAG GAGTCCTGAA GATCTTAAAA AAGTACAACA AGGGCTGGCT AACGTTGTTG 
1141 GACTCAACAG AAAAGTT GAA GAATCTGACT TTGAAAAATT AACATACTTA AGATGTTGTC 
1201 TAAAAGAAAC TCTACGACTT CACCCTCCAA TCCCTCTCCT CCTCCATGAG ACCGCCGAGG 
1261 AATCCACCGT CTCCGGCTAC CATATTCCGG CAAAGTCACA TGTTATTATA AATTCATTTG 
1321 CCATTGGGCG TGACAAAAAT TCATGGGAAG ATCCTGAAAC TTATAAACCA TCTAGGTTTC 
1381 TCAAAGAAGG TGTACCAGAT TTTAAAGGAG GTAATTTTGA GTTTATACCA TTTGGGTCGG 
1441 GTCGGCGGTC TTGCCCCGGT ATGCAACTTG GGCTTTATGC ATTGGAAATG GCTGTGGCCC 
1501 ATCTTCTTCA TTGTTTTACT TGGGAATTGC CAGATGGTAT GAAACCAAGT GAGCTTAAAA 
1561 TGGATGATAT TTTTGGACTC ACTGCTCCAA GAGCTAATCG ACTCGTGGCT GTGCCTACTC 
1621 CACGCTTGTT GTGTCCCCTT TATTAATTGA AGAAAAAAGG TGGGGCT 

SEQ. ID. NO. 188 

1 MKEMVQNNMS TSLLETLQAT PMIFYFIVPL FCLFLLSKSR RKRLPPGPTG WPLIGNMMMM 
61 DQLTHRGLAK LAQKYGGVFH LKMGYVHKIV VSGPDEARQV LQEHDIIFSN RPATVAI S YL 
121 TYDRADMAFA DYGLFWRQMR KLCVMKLFSR KRAESWDSVR DEADSMVRIV TTNTGTAVNL 
181 GELVFSLTRN IIYRAAFGTC SEDGQGEFIE IMQEFSKLFG AFNIADFIPW LGWVGKQSLN 
241 IRLAKARASL DGFIDSIIDD HIIRKKAYVN GKNDGGDRET DMVDELLAFY SEEAKVTESE 
301 DLQNAI RLTK DSIKAIIMDV MFGGTETVAS AIEWAMAELM RSPEDLKKVQ QGLANWGLN 
361 RKVEESDFEK LTYLRCCLKE TLRLHPPIPL LLHETAEEST VSGYHIPAKS HVIINSFAIG 
421 RDKNSWEDPE TYKPSRFLKE GVPDFKGGNF EFIPFGSGRR SCPGMQLGLY ALEMAVAHLL 
481 HCFTWELPDG MKPSELKMDD IFGLTAPRAN RLVAVPTPRL LCPLY 
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NAME D135-AE1 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 189 

1 GGGGGATAAG AATATGGAGA TACCATATTA 
61 AATTATCTTT GTACTAAGAT GGGCATGGAA 
121 AGAATTGGAG AAATGCATCA GACAGCAGGG 
181 TGGGGATATG AAAGAGATAA AGAAAATGGG 
241 CTCTCATGAC ATGATTTGGC CTAGAGTCAT 
301 TGGTAAGAAT TGTTTTGTGT GGTTTGGGCC 
361 ACTTGTAAAG GAGGTGCTAA CGAAGAATTT 
421 CACAAAATTG GCAGCAACTG GAATTGCAGG 
481 AAGGCTTCTC AATCCTGCTT TTCACCTTGA 
541 ATTTACTGCT TGTGAGATGT TGAGCAAATT 
601 GATAGATGTG TGGCCATATC TACAAACTTT 
661 TGGCAGTAGT TATGAAGAAG GAAGAAAGCT 
721 AATTCTAGAA GTGTCCCGCA CAATATACAT 
781 GAACAAAAGG AT GAAG C AAA TATTTAATGA 
841 GAAAAGATTG AGTATGATTG AAAATGGAGA 
901 GGCATCCAAT TTAAAAGAAA TCCAACAACA 
961 TGATGAGGTG ATTGAAGAGT GTAAACTCTT 
1021 TTTACTTGTA TGGACTATGA TTTTGTTGTG 
1081 AGAAGAGGTT TTGCAAGTGT TTGGAAGTAG 
1141 AAAAATAGTA ACT AT GAT CT TAAACGAGGT 
1201 TAATCGAATG GTAACCAAAG AAACAAAGTT 
1261 ACTCTTGTTA CCAACAATTT TGTTGCAACA 
1321 GGAGTTCAAT CCAGAGAGAT TTAGTGATGG 
1381 GTTCTTTCCA TTTAGTTGGG GTCCAAGAAT 
1441 GGCCAAGATG GCAATGGCTA TGATTCTGAA 
1501 TGCTCATGCT CCTCATCCAC TACTACTTCA 
1561 CAAGTTGTAG AAATGGTCAA TTTGGAACTT 

SEQ. ID. NO. 190 

1 MEIPYYSLKL TIFSFAIIFV LRWAWKILNY 
61 EIKKMGEEAM SKPINFSHDM IWPRVMPFIH 
121 VLTKNFVYQK PPGTPLTKLA AT G I AG YET D 
181 EMLSKLEKW SPNGTEIDVW PYLQTLTSDA 
241 SRTIYIPGWR FLPTKRNKRM KQI FNEVRAL 
301 KEIQQHGNNK KFGMSIDEVI EECKLFYFAG 
361 QVFGSREVDY DKLNQLKIVT MILNEVLRLY 
421 TILLQHDTEI WGDDAMEFNP ERFSDGISKA 
481 MAMILKNYAF ELSPSYAHAP HPLLLQPQYG 



CAGCTTAAAA CTTACAATTT TTTCATTTGC 
AATCTTGAAT TATGTGTGGT TAAAACCAAA 
TTTCAAAGGA AACTCTTACA AATTCTTGTT 
TGAAGAAGCT ATGTCTAAGC CAATCAATTT 
GCCCTTCATC CACAAAACCA TCACCAATTA 
AAGACCAGCA GTCCTGATCA CAGACCCGGA 
CGTTTATCAG AAGCCACCTG GCACTCCACT 
CTATGAAACA GATAAATGGG CTACACATAG 
CAAGTTGAAG CATATGCTAC CTGCATTCCA 
GGAGAAAGTT GTCTCACCAA AT G GAACAGA 
AACAAGT GAT GCCATTTCAA GAACTGCTTT 
TTTTGAACTT CAAAAGGAAC AACTTTCACT 
CCCAGGATGG AGGTTTTTGC CAACAAAAAG 
AGTACGAGCG CTGGTATTGG GAATTATTAA 
AGCTCCTGAT GATTTATTGG GTATATTATT 
TGGAAATAAC AAGAAATTTG GTATGAGTAT 
CTATTTTGCG GGGCAAGAGA CAACTTCATC 
CAAACATCCT AGTTGGCAAG ATAAAGCTAG 
GGAAGTTGAC TATGACAAGT TGAATCAGCT 
CTTAAGGTTG TATCCAGCAG GATATGCGAT 
AGGGAATTTA TGTTTACCAG CTGGGGTACA 
TGATACTGAA ATATGGGGAG ATGATGCAAT 
AATATCCAAA GCAACAAAAG GAAAACTTGT 
ATGTATTGGG CAAAATTTTG CTATGTTAGA 
AAACTAT GCA TTTGAACTCT CTCCATCTTA 
ACCTCAATAT GGTGCTCAAT TAATTTTGTA 
GTTATGGAAC TTTTATCATC GTAATCAACC 



VWLKPKELEK CIRQQGFKGN SYKFLFGDMK 
KTITNYGKNC FVWFGPRPAV LITDPELVKE 
KWATHRRLLN PAFHLDKLKH MLPAFQFTAC 
ISRTAFGSSY EEGRKLFELQ KEQLSLILEV 
VLGIIKKRLS MIENGEAPDD LLGILLASNL 
QETTSSLLVW TMILLCKHPS WQDKAREEVL 
PAGYAINRMV TKETKLGNLC LPAGVQLLLP 
TKGKLVFFPF SWGPRICIGQ NFAMLEAKMA 
AQLILYKL 
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NAME D141-AD7 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 191 

1 GTCCTAACTA AAAATGGAGA TTCAGTTTTC TAACTTAGTT GCATTCTTGC TCTTTCTCTC 
61 CAGCATCTTT CTTCTATTCA AAAAATGGAA AACCAGAAAA CTAAATTTGC CTCCTGGTCC 
121 ATGGAAATTA CCTTTTATTG GAAGTTTACA CCATTTGGCT GTGGCAGGTC CACTTCCTCA 
181 CCATGGCCTA AAAAATTTAG CCAAACGCTA TGGTCCTCTT ATGCATTTAC AACTTGGACA 
241 AATTCCTACA CTCATCATAT CATCACCTCA AATGGCAAAA GAAGTACTAA AAACTCACGA 
301 CCTCGCTTTT GCCACTAGAC CAAAGCTTGT CGTGGCCGAC ATCATTCACT AC GACAGCAC 
361 GGACATAGCA TTTTCTCCGT ACGGTGAATA CTGGAGACAA ATTCGTAAAA TTTGCATATT 
421 GGAACTCTTG AGTGCCAAGA TGGTCAAATT TTTTAGCTCG ATTCGCCAAG ATGAGCTCTC 
481 GAAGATGCTC TCATCTATAC GAACGACACC CAATCTTACA GTCAATCTTA CTGACAAAAT 
541 TTTTTGGTTT ACGAGTTCGG TAACTTGTAG ATCAGCTTTA GGGAAGATAT GTGGTGACCA 
601 AGACAAATTG ATCATTTTTA TGAGGGAAAT AATATCATTG GCAGGTGGAT TTAGTATTGC 
661 TGATTTTTTC CCTACATGGA AAATGATTCA TGATATTGAT GGTTCGAAAT CTAAACTGGT 
721 GAAAGCACAT CGTAAGATTG ATGAAATTTT GGGAAATGTT GTTGATGAGC ACAAAAAGAA 
781 CAGAGCAGAT GGCAAGAAGG GTAATGGTGA ATTTGGTGGT GAAGATTTGA TTGATGTATT 
841 GTTAAGAGTT AGAGAAAGTG GAGAAGTTCA AATTCCTATC ACAAATGACA ATATCAAATC 
901 AATATTAATC GACATGTTCT CTGCGGGATC TGAAACATCA TCGACGACTA TAATTTGGGC 
961 ATTAGCTGAA AT GAT GAAGA AACCAAGTGT TTTAGCAAAG GCACAAGCTG AAGTAAGGCA 
1021 AGCTTTGAAG GAGAAAAAAG GTTTTCAACA GATTGATCTT GATGAGCTAA AATATCTCAA 
1081 GTTAGTAATC AAAGAAACCT TAAGAATGCA CCCTCCAATT CCTCTATTAG TTCCTAGAGA 
1141 ATGTATGGAG GATACAAAGA TTGATGGTTA CAATATACCT TTCAAAACAA GAGTCATAGT 
1201 TAATGCATGG GCAATCGGAC GAGATCCAGA AAGTTGGGAT GACCCCGAAA GCTTTATGCC 
1261 AGAGAGATTT GAGAATAGTT CTATTGACTT TCTTGGAAAT CATCATCAGT TTATACCATT 
1321 TGGTGCAGGA AGAAGGATTT GTCCGGGAAT GCTATTTGGT TTAGCTAATG TTGGACAACC 
1381 TTTAGCTCAG TTACTTTATC ACTTCGATTG GAAACTCCCT AATGGACAAA GTCATGAGAA 
1441 TTTCGACATG ACTGAGTCAC CTGGAATTTC TGCTACAAGA AAGGATGATC TTGTTTTGAT 
1501 TGCCACTCCT TATGATTCTT ATTAAGCAGT AGCAGAAATA AAAAGCCGGG GCAAACAGAA 
1561 AAAAGT 

SEQ. ID. NO. 192 

1 MEIQFSNLVA FLLFLSSIFL LFKKWKTRKL NLPPGPWKLP FIGSLHHLAV AGPLPHHGLK 
61 NLAKRYGPLM HLQLGQIPTL IISSPQMAKE VLKTHDLAFA TRPKLWADI IHYDSTDIAF 
121 SPYGEYWRQI RKICILELLS AKMVKFFSSI RQDELSKMLS SIRTTPNLTV NLTDKIFWFT 
181 SSVTCRSALG KICGDQDKLI IFMREIISLA GGFSIADFFP TWKMIHDIDG SKSKLVKAHR 
241 KIDEILGNW DEHKKNRADG KKGNGEFGGE DLIDVLLRVR ESGEVQIPIT NDNIKSILID 
301 MFSAGSETSS TTIIWALAEM MKKPSVLAKA QAEVRQALKE KKGFQQIDLD ELKYLKLVIK 
361 ETLRMHPPIP LLVPRECMED TKIDGYNIPF KTRVIVNAWA IGRDPESWDD PESFMPERFE 
421 NSSIDFLGNH HQFIPFGAGR RICPGMLFGL ANVGQPLAQL LYHFDWKLPN GQSHENFDMT 
481 ESPGISATRK DDLVLIATPY DSY 
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NAME D147-AD3 
ORGANISM NICOTIANA TABACUM 



SEQ. ID. NO. 193 , nnmr ^r TCCTCTCCCA AATCACTGAT TCACCACCAA AAGTACCAAC 

1 CAACTAACAA ACACATTGAG TCCTCTCCCA ™ TATTTCTTGA TACTCTGTTT 

61 AATTCAATGG AAGGTACAAA CTTGACTACA ATTTACCTCC AGGCCCAAAA 

121 CTTTTGTTCC TTTCCAAACT TCTTCGCCAG AGGAAACTCA ATTTA CTCAMCCAC 
181 CCATGGCCGA TCATCGGAAA CTTAAACCTT ATTOBttMC TT^^ CCCCGTTGTA 
241 GAACTCTCCC TCAAGTACGG ACCCGTTATG CAACT CCATGGATAT TAACTTTGTA 

301 GTTGGATCCT CCGTCGAAAT GGCTAAGATT ATTATTCCGA TATTACATGG 

361 GGCAGGCCTA AAACGGCTGC CGGAAAATAC ACAACGTACA ATTATT ATTATTCAGC 
421 TCTCCTTACG GACCATATTG GCGCCAGGCA CGTAGGATGT ^ TCTTCTCCAT 
481 ACGAAACGTC TCGATTCATA CGAGTATATT CGGGCTGAGG AGTTGCA 
541 AATT T GAACA AAATATCAGG ^AAACCAATT GTGTTGAAAG ATTA CGAGAACTCG 
601 TTAAATGTTA TTAGCAGGAT GGTACTGGGG AAAAGGTATT ^GA ^tggTGTA 
661 TTCGTGAATC CTGAGGAATT TAAGAAGATG TTGGACGAAT ^1 TTATGTTAAG 
721 CTTAATATTG GAGATTCAAT TCCATGGATT «TTTCMGG MTT TGATGAGCAT 
781 AGGATGAAAG TAGTGAGCAA GAAATTCGAC AAGlil ATATGGTGGA TGTTTTGTTG 

841 AACAT TAGGA GAAATGGAGT GGAGAATTAT ^TGCTAAGG ^ ^ ^CATTC 
901 CAGCTCGCTG ATGATCCGAA GTTGGAAGTT ^CUrf^ TGACAGTGGA GTGGGCAATT 
961 ACTCAGGATA TGCTGGCTGG TGGAACCGAG AGTTCAGCAG TGA^^ GGATCGAGTA 
1021 TCAGAGCTGC TAAAGAAGCC GGAGATTTTC ATCTTCCTTA CATAGAGGCA 

1081 ATTGGGCAGA ATAGATGGGT ACAAGAAAAG ^CA^CAA ATCT ACGTGAGTGT 
1141 ATAGT CAAAG AGACTATGCG ACTGCACCCC GTGGCACCAA ^ TCTCGTGAGT 
1201 CGAGAAGATA TTAAGGTAGC AGGCTACGAC ^TCAGAAAG ^ CAAGCCGGAG 
1261 GT AT GGACTA TTGGAAGAGA CCCTACATTG ^GGGACGAGC ^ GCCATTTGGA 
1321 AGATTCCATG AAAGGTCCAT AGATGTTAAA GGACATGATT ATGAGC AGCTAGCTTA 

\H\ K5K SSSS SSE ™ ~g C C 5KS5 
\H\ SSSES ACTTTACTCT CTTTGATTCA GCAGTTCTAT GGTTCCGTCA 

1621 AGATAGACTT TGTTACGTTT GAACCTGTGC TC 

SEQ. ID. NO. 194 „ T • „ «.„, T ROB K LNLPPGPKPW PIIGNLNLIG NLPHRSIHEL 

1 MEGTNLTTYA AVFLDTLFLL FLSKLLRQRK "J^b^r PKTAAGKYTT YNYSDITWSP 
61 SLKYGPVMQL QFGSFPVWG fSVEMAKIFL KSMDINFVGR P^AAGKYTT ™^ ^ 
121 YGPYWRQARR MCLTELFSTK RLDSYEYIRA EELHSLL^L MDLQGYVKRM 
111 VISRMVLGKR YLDESENSFV NPEEFKKMLD E™GVLN ^SIPWIDF 
241 KWSKKFDKF LEHVIDEHNI ™NG^NW A ™^£ G QNRWVQEKDI PNLPYIEAIV 
301 DMLAGGTESS AVTVEWAISE ™EIFKK ATEELDRVIG QNR g EPEVFKPERF 

III SSESl DYELLPFGAG S» ^GEWS.P DNMTPEDLNM 

481 DEIFGLSTPK KFPLATVIEP RLSPKLYSV 
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NAME D163-AF12 
ORGANISM N I COT I ANA TABACUM 

SEQ. ID. ^°^^ TCCTT ccTAACTAAA AATGGAGATT CAGTTTTCTA ACTTAGTTGC ATTCTTGCTC 
61 TTTCTCTCCA GCATCTTTCT TGTATTCAAA AAATGGAAAA CCAGAAAACT AAATTTGCCT 
121 CCTGGTCCAT GGAAATTACC TTTTATTGGA AGTTTACACC ATTTGGCTGT GGCAGGTCCA 
181 CTTCCTCACC ATGGCCTAAA AAATTTAGCC AAACGCTATG GTCCTCTTAT GCATTTACAA 
241 CTTGGACAAA TTCCTACACT CGTCATATCA TCACCTCAAA TGGCAAAAGA AGTACTAAAA 
301 ACTCACGACC TCGCTTTTGC CACTAGACCA AAGCTTGTCG TGGCCGACAT CATTCACTAC 
361 GACAGCACGG ACATAGCATT TTCGCCATAC GGTGAATACT GGAGACAAAT TCGTAAAATT 
421 TGCATATTGG AACTCTTGAG TGCCAAGATG GTCAAGTTTT TTAGCTCGAT TCGCCAAGAT 
481 GAGCTCTCGA AGATGGTTTC ATCTATACGA ACGACGCCCA ATCTTCCAGT CAATCTTACC 
541 GACAAGATTT TTTGGTTTAC GAGTTCGGTA ATTTGTAGAT CAGCTTTAGG GAAGATATGT 
601 GGTGACCAAG ACAAATTGAT CATTTTTATG AGGGAAATAA TATCATTGGC AGGTGGATTT 
661 AGTATTGCTG ATTTTTTCCC TACATGGAAA ATGATTCATG ATATTGATGG TTCAAAATCT 
721 AAACTGGTGA AGGCACATCG TAAGATTGAT GAAATTTTGG AAAATGTGGT AAATGAGCAC 
781 AAACAGAATC GAGCAGATGG TAAAAAGGGT AATGGTGAAT TTGGTGGAGA AGATCTGATT 
841 GATGTTTTGT TAAGAGTTAG AGAAAGTGGA GAAGTTCAAA TTCCAATCAC AGATGACAAT 
901 ATCAAATCAA TATTAATCGA CATGTTCTCT GCCGGATCGG AAACATCATC GACAACTATA 
961 ATTTGGGCAT TAGCTGAAAT GATGAAGAAA CCAAGTGTTT TAGCAAAGGC ACAAGCTGAA 
1021 GTGAGGCAAG CTTTGAAGGG GAAGAAAATT AGTTTTCAAG AGATTGATAT TGATAAGCTA 
1081 AAGTATTTGA AGTTAGTGAT CAAAGAAACT TTAAGAATGC ACCCTCCAAT TCCTCTGTTA 
1141 GTCCCTAGAG AATGTATGGA AGATACAAAG ATTGATGGTT ACAATATACC TTTCAAAACA 
1201 AGAGTCATTG TTAATGCATG GGCAATTGGA CGAGATCCTC AAAGTTGGGA TGATCCTGAA 
1261 AGCTTTACGC CAGAGAGATT TGAGAATAAT TCTATTGATT TTCTTGGAAA TCATCATCAA 
1321 TTTATTCCAT TTGGTGCAGG AAGAAGGATT TGTCCTGGAA TGCTATTTGG TTTAGCTAAT 
1381 GTTGGACAAC CTTTAGCTCA GTTACTTTAT CACTTCGATT GGAAACTCCC TAATGGACAA 
1441 AGT CAT GAGA ATTTCGACAT GACTGAGTCA CCTGGAATTT CTGCTACAAG AAAGGATGAT 
1501 CTTGTTTTGA TTGCCACTCC TTATGATTCT TATTAAGCAG TAGCAGAAAT AAAAAGCCGG 
1561 GGCAAACAGA AAAAAGTATT GCTGCTTCTA GGTATTTTCT ATTGGATAAA TTTCAAAATT 
1621 CATCCACAAT ATTTAGTGTT TGCTAGAGTT GGTTAGC 



1 MEIQFSNLVA FLLFLSSIFL VFKKWKTRKL NLPPGPWKLP FIGSLHHLAV AGPLPHHGLK 
61 NLAKRYGPLM HLQLGQIPTL VISSPQMAKE VLKTHDLAFA TRPKLWADI IHYDSTDIAF 
121 SPYGEYWRQI RKICILELLS AKMVKFFSSI RQDELSKMVS SIRTTPNLPV NLTDKIFWFT 
181 SSVICRSALG KICGDQDKLI IFMREIISLA GGFSIADFFP TWKMIHDIDG SKSKLVKAHR 
241 KIDEILENW NEHKQNRADG KKGNGEFGGE DLIDVLLRVR ESGEVQIPIT DDNIKSILID 
301 MFSAGSETSS TTIIWALAEM MKKPSVLAKA QAEVRQALKG KKISFQEIDI DKLKYLKLVI 
361 KETLRMHPPI PLLVPRECME DTKIDGYNIP FKTRVIVNAW AIGRDPQSWD DPESFTPERF 
421 ENNSIDFLGN HHQFIPFGAG RRICPGMLFG LANVGQPLAQ LLYHFDWKLP NGQSHENFDM 
481 TESPGISATR KDDLVLIATP YDSY 
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NAME D163-AG11 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 197 

1 CTTCTTCCTT CCTAACTAAA AATGGAGATT CAGTTTTCTA ACTTAGTTGC ATTCTTGCTC 
61 TTTCTCTCCA GCATCTTTCT TGTATTCAAA AAATGGAAAA CCAGAAAACT AAATTTGCCT 
121 CCTGGTCCAT GGAAATTACC TTTTATTGGA AGTTTACACC ATTTGGCTGT GGCAGGTCCA 
181 CTTCCTCACC ATGGCCTAAA AAATTTAGCC AAACGCTATG GTCCTCTTAT GCATTTACAA 
241 CTTGGACAAA TTCCTACACT CGTCATATCA TCACCTCAAA TGGCAAAAGA AGTACTAAAA 
301 ACTCACGACC TCGCTTTTGC CACTAGACCA AAGCTTGTCG TGGCCGACAT CATTCACTAC 
361 GACAGCACGG ACATAGCACT TTCGCCATAC GGTGAATACT GGAGACAAAT TCGTAAAATT 
421 TGCATATTGG AACTCTTGAG TGCCAAGATG GTCAAGTTTT TTAGCTCGAT TCGCCAAGAT 
481 GAGCTCTCGA AGATGGTTTC ATCTATACGA ACGACGCCCA ATCTTCCAGT CAATCTTACC 
541 GACAAGATTT TTTGGTTTAC GAGTTCGGTA ATTTGTAGAT CAGCTTTAGG GAAGATATGT 
601 GGTGACCAAG ACAAATTGAT CATTTTTATG AGGGAAATAA TATCATTGGC AGGTGGATTT 
661 AGTATTGCTG ATTTTTTCCC TACATGGAAA ATGATTCATG ATATTGATGG TTCAAAATCT 
721 AAACTGGTGA AGGCACATCG TAAGATTGAT GAAATTTTGG AAAATGTGGT AAATGAGCAC 
781 AAACAGAATC GAGCAGATGG TAAAAAGGGT AATGGTGAAT TTGGTGGAGA AGATCTGATT 
841 GATGTTTTGT TAAGAGTTAG AGAAAGT GGA GAAGTTCAAA TTCCAATCAC AGATGACAAT 
901 ATCAAATCAA TATTAATCGA CATGTTCTCT GCCGGATCGG AAACATCATC GACAACTATA 
961 ATTTGGGCAT TAGCTGAAAT GATGAAGAAA CCAAGTGTTT TAGCAAAGGC ACAAGCTGAA 
1021 GTGAGCCAAG CTTTGAAGGG GAAGAAAATT AGTTTTCAAG AGATTGATAT TGATAAGCTA 
1081 AAGTATTTGA AGTTAGTGAT CAAAGAAACT TTAAGAATGC ACCCTCCAAT TCCTCTGTTA 
1141 GTCCCTAGAG AATGTATGGA AGATACAAAG ATTGATGGTT ACAATATACC TTTCAAAACA 
1201 AGAGTCATTG TTAATGCATG GGCAATTGGA CGAGATCCTC AAAGTTGGGA TGATCCTGAA 
1261 AGCTTTACGC CAGAGAGATT TGAGAATAAT TCTATTGATT TTCTTGGAAA TCATCATCAA 
1321 TTTATTCCAT TTGGTGCAGG AAGAAGGATT TGTCCTGGAA TGCTATTTGG TTTAGCTAAT 
1381 GTTGGACAAC CTTTAGCTCA GTTACTTTAT CACTTCGATT GGAAACTCCC TAAT GGACAA 
1441 ACTCACCAAA ATTTCGACAT GACTGAGTCA CCTGGAATTT CTGCTACAAG AAAGGATGAT 
1501 CTTATTTTGA TTGCCACTCC TGCTCATTCT TGATTAAGTA TTGCTGCTTT TCTATTGGAG 
1561 AATTTTCAAA ATTCATCCAC AATATATAGT GTTTGCTAGA GTTGGTTAGC 

SEQ. ID. NO. 198 

1 MEIQFSNLVA FLLFLSSIFL VFKKWKTRKL NLPPGPWKLP FIGSLHHLAV AGPLPHHGLK 
61 NLAKRYGPLM HLQLGQIPTL VISSPQMAKE VLKTHDLAFA TRPKLWADI IHYDSTDIAL 
121 SPYGEYWRQI RKICILELLS AKMVKFFSSI RQDELSKMVS SIRTTPNLPV NLTDKIFWFT 
181 SSVICRSALG KICGDQDKLI IFMREIISLA GGFSIADFFP TWKMIHDIDG SKSKLVKAHR 
241 KIDEILENW NEHKQNRADG KKGNGEFGGE DLIDVLLRVR ESGEVQIPIT DDNIKSILID 
301 MFSAGSETSS TTIIWALAEM MKKPSVLAKA QAEVSQALKG KKISFQEIDI DKLKYLKLVI 
361 KETLRMHPPI PLLVPRECME DTKIDGYNIP FKTRVIVNAW AIGRDPQSWD DPESFTPERF 
421 ENNSIDFLGN HHQFIPFGAG RRICPGMLFG LANVGQPLAQ LLYHFDWKLP NGQTHQNFDM 
481 TESPGISATR KDDLILIATP AHS 
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NAME D163-AG12 
ORGANISM NICOTIANA 
SEQ. ID. NO. 199 

1 ATCCTTCTTC 
61 ATTCTTGCTC 
121 AAATTTGCCT 
181 GGCAGGTCCA 
241 GCATTTACAA 
301 AGTACTAAAA 
361 CATTCACTAC 
421 TCGTAAAATT 
481 TCGCCAAGAT 
541 CAATCTTACT 
601 GAAGAT AT GT 
661 AGGTGGATTT 
721 TTCGAAATCT 
781 TGATGAGCAC 
841 AGATTTGATT 
901 AAATGACAAT 
961 GACGACTATA 
1021 ACAAGCTGAA 
1081 TGAGCTAAAA 
1141 TCTATTAGTT 
1201 CAAAACAAGA 
1261 CCCCGAAAGC 
1321 TCATCAGTTT 
1381 AGCTAATGTT 
1441 TGGACAAAGT 
1501 GGATGATCTT 
1561 AAGCCGGGGC 
1621 CAAAATTCAT 

SEQ. ID. NO. 200 

1 MEIQFSNLVA 
61 NLAKRYGPLM 
121 SPYGEYWRQI 
181 SSVTCRSALG 
241 KIDEILGNW 
301 MFSAGSETSS 
361 ETLRMHPPIP 
421 NSSIDFLGNH 
481 ESPGISATRK 



TABACUM 



CTTCCTAGGT 
TTTCTCTCCA 
CCTGGTCCAT 
CTTCCTCACC 
CTTGGACAAA 
ACTCACGACC 
GACAGCACGG 
TGCATATTGG 
GAGCTCTCGA 
GACAAAATTT 
GGTGACCAAG 
AGTATTGCTG 
AAACTGGTGA 
AAAAAGAACA 
GATGTATTGT 
ATCAAATCAA 
ATTTGGGCAT 
GTAAGGCAAG 
TATCTCAAGT 
CCTAGAGAAT 
GTCATAGTTA 
TTTATGCCAG 
ATACCATTTG 
GGACAACCTT 
CATGAGAATT 
GTTTTGATTG 
AAACAGAAAA 
CCACAATATT 



CCTAACTAAA 
GCATCTTTCT 
GGAAATTACC 
ATGGCCTAAA 
TTCCTACACT 
TCGCTTTTGC 
ACATAGCATT 
AACTCTTGAG 
AGATGCTCTC 
TTTGGTTTAC 
ACAAATTGAT 
ATTTTTTCCC 
AAGCACATCG 
GAGCAGATGG 
TAAGAGTTAG 
TATTAATCGA 
TAGCTGAAAT 
CTTTGAAGGA 
TAGTAATCAA 
GTATGGAGGA 
ATGCATGGGC 
AGAGATTTGA 
GTGCAGGAAG 
TAGCTCAGTT 
TCGACATGAC 
CCACTCCTTA 
AAGTATTGCT 
TAGTGTTTGC 



AATGGAGATT 
TCTATTCAAA 
TTTTATTGGA 
AAATTTAGCC 
CAT CAT AT CA 
CACTAGACCA 
TTCTCCGTAC 
TGCCAAGATG 
ATCTATACGA 
GAGTTCGGTA 
CATTTT TAT G 
TACATGGAAA 
TAAGATTGAT 
CAAGAAGGGT 
AGAAAGTGGA 
CATGTTCTCT 
GATGAAGAAA 
GAAAAAAGGT 
AGAAACCTTA 
TACAAAGATT 
AATCGGACGA 
GAATAGTTCT 
AAGGATTTGT 
ACTTTATCAC 
TGAGTCACCT 
TGATTCTTAT 
GCTTCTAGGT 
TAGAGTTGGT 



CAGTTTTCTA 
AAATGGAAAA 
AGTTTACACC 
AAACGCTATG 
TCACCTCAAA 
AAGCTTGTCG 
GGTGAATACT 
GTCAAATTTT 
ACGACACCCA 
ACTTGTAGAT 
AGGGAAATAA 
ATGATTCATG 
GAAATTTTGG 
AATGGTGAAT 
GAAGTTCAAA 
GCGGGATCTG 
CCAAGTGTTT 
TTTCAACAGA 
AGAATGCACC 
GATGGTTACA 
GATCCAGAAA 
ATTGACTTTC 
CCGGGAATGC 
TTCGATTGGA 
GGAATTTCTG 
TAAGCAGTAG 
ATTTTCTATT 
TAGC 



ACTTAGTTGC 
CCAGAAAACT 
ATTTGGCTGT 
GTCCTCTTAT 
TGGCAAAAGA 
TGGCCGACAT 
GGAGACAAAT 
TTAGCTCGAT 
ATCTTACAGT 
CAGCTTTAGG 
TATCATTGGC 
ATATTGATGG 
GAAATGTTGT 
TTGGTGGTGA 
TTCCTATCAC 
AAACATCATC 
TAGCAAAGGC 
TTGATCTTGA 
CTCCAATTCC 
ATATACCTTT 
GTTGGGATGA 
TTGGAAATCA 
TATTTGGTTT 
AACTCCCTAA 
CTACAAGAAA 
CAGAAATAAA 
GGATAAATTT 



FLLFLSSIFL 
HLQLGQIPTL 
RKICILELLS 
KICGDQDKLI 
DEHKKNRADG 
TTIIWALAEM 
LLVPRECMED 
HQFIPFGAGR 
DDLVLIATPY 



LFKKWKTRKL 
IISSPQMAKE 
AKMVKFFSSI 
IFMREIISLA 
KKGNGEFGGE 
MKKPSVLAKA 
TKIDGYNIPF 
RICPGMLFGL 
DSY 



NLPPGPWKLP 
VLKTHDLAFA 
RQDELSKMLS 
GGFSIADFFP 
DLIDVLLRVR 
QAEVRQALKE 
KTRVIVNAWA 
ANVGQPLAQL 



FIGSLHHLAV 
TRPKLWADI 
SIRTTPNLTV 
TWKMIHDIDG 
ESGEVQIPIT 
KKGFQQIDLD 
IGRDPESWDD 
LYHFDWKLPN 



AGPLPHHGLK 
IHYDSTDIAF 
NLTDKIFWFT 
SKSKLVKAHR 
NDNIKSILID 
ELKYLKLVIK 
PESFMPERFE 
GQSHENFDMT 
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NAME D205-BG9 
ORGANISM N I COT I AN A TABACUM 

SEQ. ID. ^° < J, T ^^ ATTTT q^tTCAACCA TGGAGAACCA ATACTCCTAC TCATTCTCTT CCTACTTCTA 
61 CTTAGCTATA GTACTGTTTC TTCTTCCAAT TTTGGTCAAA TATTTCTTCC AT C GGAGAAG 
121 AAATTTACCT CCAAGTCCAT TTTCTCTTCC AATAATTGGT CACCTTTACC TTCTCAAGAA 
181 AACTCTCCAT CTCACTCTAA CATCCTTATC AGCTAAATAT GGTCCTGTTT TATACCTCAA 
241 ATTGGGCTCT ATGCCTGTGA TTGTTGTGTC CTCACCATCT GCTGTTGAAG AATGTTTAAC 
301 CAAGAATGAT AT CAT ATT CG CAAATAGGCC CAAGACCGTG GCTGGTGACA AGTTTACCTA 
361 CAATTATACT GTTTATGTTT GGGCACCCTA TGGCCAACTT TGGAGAATTC TTCGCCGATT 
421 AACTGTCGTT GAACTCTTCT CTTCACATAG CCTACAGAAA ACTTCTATCC TTAGAGATCA 
481 AGAAGTTGCA ATATTTATCC GTTCGTTATA CAAATTCTCA AAGGATAGTA GCAAAAAAGT 
541 CGATTTGACC AACTGGTCTT TTACTTTGGT TTTCAATCTT ATGACCAAAA TTATTGCTGG 
601 GAGACATATT GTGAAGGAGG AAGATGCTGG CAAGGAAAAG GGCATTGAAA TTATTGAAAA 
661 ACTTAGAGGG ACTTTCTTAG TAACTACATC ATTCTTGAAT ATGTGTGATT TCTTGCCAGT 
721 ATTCAGGTGG GTTGGTTACA AAGGGCTGGA GAAGAAGATG GCCTCAATTC ACAATAGAAG 
781 AAATGAATTC TTGAACAGCT TGCTTGATGA ATTTCGACAC AAGAAAAGTA GTGCTTCACA 
841 ATCTAACACA ACTGTTGGAA ACATGGAGAA GAAAACCACA CTGATTGAAA AGCTCTTGTC 
901 TCTTCAAGAA TCAGAGCCTG AATTCTACAC TGATGATATC ATCAAAAGTA TTATGCTGGT 
961 AGTTTTTGTT GCAGGAACAG AGACCTCATC AACAACCATC CAATGGGTAA TGAGGCTTCT 
1021 TGTAGCTCAC CCTGAGGCAT TGTATAAGCT ACGAGCTGAC ATTGACAGTA AAGTTGGGAA 
1081 TAAGCGCTTG CTGAATGAAT CAGACCTCAA CAAGCTTCCG TATTTGCATT GTGTTGTTAA 
1141 TGAGACAATG AGATTATACA CTCCGATACC ACTTTTATTG CCTCATTATT CAACTAAAGA 
1201 TTGTATTGTG GAAGGATATG ATGTACCAAA ACATACAATG TTGTTTGTCA ACGCTTGGGC 
1261 CATTCACAGG GATCCCAAGG TATGGGAGGA GCCTGACAAG TTCAAGCCAG AGAGATTTGA 
1321 GGCAACAGAA GGGGAAACAG AAAGGTTCAA TTACAAGCTT GTACCATTTG GAATGGGGAG 
1381 AAGAGCGTGC CCTGGAGCTG ATATGGGGTT GCGAGCAGTT TCTTTGGCAT TAGGTGCACT 
1441 TATTCAATGC TTTGACTGGC AAATTGAGGA AGCGGAAAGC TTGGAGGAAA GCTATAATTC 
1501 TAGAATGACT ATGCAGAACA AGCCTTTGAA GGTTGTCTGC ACTCCACGCG AAGATCTTGG 
1561 CCAGCTTCTA TCCCAACTCT AAGGCAATTT ATCAATGCCA AACGTAATCT TCATCTACCA 
1621 CTATG 

SEQ. ID. ^*£ gYgFS SYFYLAIVLF LLPILVKYFF HRRRNLPPSP FSLPIIGHLY LLKKTLHLTL 
61 TSLSAKYGPV LYLKLGSMPV IWSSPSAVE ECLTKNDIIF ANRPKTVAGD KFTYNYTVYV 
121 WAPYGQLWRI LRRLTWELF SSHSLQKTSI LRDQEVAIFI RSLYKFSKDS SKKVDLTNWS 
181 FTLVFNLMTK IIAGRHIVKE EDAGKEKGIE IIEKLRGTFL VTTSFLNMCD FLPVFRWVGY 
241 KGLEKKMAS I HNRRNEFLNS LLDEFRHKKS SASQSNTTVG NMEKKTTLIE KLLSLQESEP 
301 EFYTDDIIKS IMLWFVAGT ETSSTTIQWV MRLLVAHPEA LYKLRADIDS KVGNKRLLNE 
361 SDLNKLPYLH CWNETMRLY TPIPLLLPHY STKDCIVEGY DVPKHTMLFV NAWAIHRDPK 
421 VWEEPDKFKP ERFEATEGET ERFNYKLVPF GMGRRACPGA DMGLRAVSLA LGALIQCFDW 
481 QIEEAESLEE SYNSRMTMQN KPLKWCTPR EDLGQLLSQL 



FIG. 102 



NAME D207-AA5 
ORGANISM N I COT I ANA TABACUM 

SE Q. ID. NO- 20^ CCTTTTCTTA CTTAGTAAAA TGGATATTCA GTCTTCTCCT TTCAACTTAA 

XgctTTGCT ACTCTTCATT TCATTTCTTT TTATCCTATT GAAAAAGTGG AATACCAAAA 
J "ccIHgtt acctccaggt ccatggagac TTCCCCTTAT TGGCAGCCTC CATCACTTGA 
\l\ Hggtaaact cccacaccat catcttagag atttagcccg aaaatatgga cctctcatgt 

241 ATTTACAACT TGGAGAAGTT CCTGTAGTTG TAATATCTTC GCCACGTATA GCAAAAGCTG 
301 TACTAAAAAC TCATGATCTT GCTTTTGCAA CGAGGCCTCG GTTCATGTCC TCGGACATTG 
111 TGTTTTACAA AAGCAGGGAC ATATCATTCG CCCCATATGG CGATTACTGG AGACAAATGC 
421 GTAAAATATT AACACAAGAA CTCTTGAGTA ACAAGATGCT CAAGTCATTT AGCACAATCC 

5" SSI S£aag ctcctctcgt cgattcgttt agcaacagct tcttctgcag 

III TGAACATAAA CGAAAAGCTT CTCTGGTTTA CAAGTTGCAT GACTTGTAGA TTAGCCTTTG 



601 1 = ' ^ ; ,™ ^™ ™ ™™ 

III SSSSS ESSE CTGAGGGTTA A^^ T^GCTTCAA TTTCCTATCG 



661 CAGGAGGATT TGATGTGTGT GATTTGTTCC CTTCATGGAA ATTACTTCAC AATATGAGCA 
721 ™ tS^cg AATGTTCACC ATAAGTATAA TCTAATTATG gagaatatca 
tSatgagca caaagagaat catgcagcag ggataaaggg aaataacgag tttggtggcg 



££££££ GTWTCTGG ACTTGTTTAT TGCTGGAACT OAAACTTCAT 
961 ATACTGCAAT TATATGGGCA CTATCAGAAT TGATGAAGCA CCCAAGTGTT ATGGCCAAGG 

1021 cSgctga agtgagaaaa gtcttcaaag aaaatgaaaa cttggacgaa aatgatcttg 

llll ACAAGTTGCC ATACTTAAAA TCAGTGATCA AAGAAACACT AAGGATGCAT CCTCCAGTTC 
llll CTTTATTAGG ACCTAGAGAA TGCAGAGAAC AAACTGAGAT TGATGGATAT ACTGTACCTC 
1201 TTAAAGCTAG AGTAATGGTT AATGCATGGG CAATTGGAAG AGATCCTGAA AGTTGGGAAG 
1261 ATCCTGAAAG TTTCAAACCC GAGCGATTTG AAAATATTTC TGTTGATCTT ACGGGAAATC 
llll ACTATCAGTT CATCCCTTTC GGTTCAGGAA GAAGAATGTG TCCAGGAATG TCGTTTGGTT 
IfA TAGTTAACAC TGGGCATCCT TTAGCTCAGT TGCTCTATTT CTTTGACTGG AAATTCCCTC 
llll ItSSa£ TGCAGCTGAT TTTCACACTA CTGAAACAAG TAGAGTTTTT gcagcaagca 
1501 AAGATGACCT CTACTTGATT CCAACAAATC ACATGGAGCA AGAGTAGCTC TAAATTGAAT 
llll TCTTGTCTTG GAACAATAAA AGAAGAAACT CCAGCTTGGT CTACATTATT TCTTTTTGCT 
1621 TTATATTAGT ATGGGTGTGT TCAGTTTCTT ATTTTTAAGG GTACCCTGAA AGATAAAGGG 
llll CTATATAAAC CAGTGAGACT TTTTATTGGT TGCAAGGTTT TAGATCAAGC CATAAGACAG 
1741 CATATTTTAT TCAAAAAAAA AAAAAAA 

SEQ. ID. NO^ 2" IALLLFISFL FILLKKWNTK IPKLPPGPWR LPLIGSLHHL kgklphhhlr 

61 DLARKYGPLM YLQLGEVPW VISSPRIAKA VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 
121 APYGDYWRQM RKILTQELLS NKMLKSFSTI RKDELSKLLS SIRLATASSA VNINEKLLWF 
ill TSCMTCRLAF GKICNDRDEL IMLIREILAL SGGFDVCDLF PSWKLLHNMS NMKARLTNVH 
III SS INEHKENHAA GI KGNNEFGG EDMIDALLRV KENNELQFPI ENDNMKAVIL 
301 SEES YTAIIWALSE LMKHPSVMAK AQAEVRKVFK ENENLDENDL DKLPYLKSVI 
361 KETLRMHPPV PLLGPRECRE QTEIDGYTVP LKARVMVNAW AIGRDPESWE DPESFKPERF 
421 ENISVDLTGN HYQFI PFGSG RRMCPGMSFG LVNTGHPLAQ LLYFFDWKFP HKVNAADFHT 
481 TETSRVFAAS KDDLYLIPTN HMEQE 



FIG. 103 



NAME D207-AB4 
ORGANISM N I COT I AN A TABACUM 

SEQ. ID. NO 205 CTTAGTAAAA TGGATATTCA GTCTTCTCCT TTCAACTTAA 

61 WGC^TGCT ACTCTTCATT TCATTTCTTT TTATCCTATT GAAAAAGTGG AATACCAAAA 
191 tccSIagtt ACCTCCAGGT CCATGGAGAC TTCCCCTTAT TGGCAGCCTC CATCACTTGA 
III IS^ACT CcScA^CAT CATCTTAGAG ATTTAGCCCG AAAATATGGA CCTCTCATGT 
241 MTTACAACT TGGAGAAGTT CCTGTAGTTG TAATATCTTC GCGACGTATA GCAAAAGCTG 
301 tSaa^ac TCATGATCTT GCTTTTGCAA CGAGGCCTCG GTTCATGTCC tcggacattg 
III TGTTTTACAA AAGCAGGGAC ATATCATTCG CCCCATATGG CGATTACTGG AGACAAATGC 
III gtaaaItSt AACACAAGAA CTCTTGAGTA ACAAGATGCT caagtcattt agcacaatcc 
ill rl^TTATGG GCTCTCGAAG CTCCTCTCGT CGATTCGTTT AGCAACAGCT TCTTCTGCAG 
SSS? CTCTGGTTTA CAAGTTGCAT GACTTGTAGA TTAGCCTTTG 

HI CAGGAGGATT ££££ SSSSS SESSSZ " 

721 »" AATGTTCACC ATAAGTATAA TCTAATTATG GAGAATATCA 

7fll tSatgagca CAAAGAGAAT CATGCAGCAG ggataaaggg aaataacgag tttggtggcg 

III AATATATGAT TGATGCTTTA CTGAGGGTTA AGGAGAATAA TGAGCTTCAA TTTCCTATCG 
Hi aSS ^gaaagcX GTAATTCTGG ACTTGTTTAT TGCTGGAACT GAAACTTCAT 

III a^tS Stmgggca ctatcagaat tgatgaagca cccaagtgtt atggccaagg 
in2i SSa agtgagaaaa gtcttcaaag aaaatgaaaa cttggacgaa aatgatcttg 

llll ACAAGTTGCC ATACTTAAAA TCAGTGATCA AAGAAACACT AAGGATGCAT CCTCCAGTTC 
llll cmATTAGG ACCTAGAGAA TGCAGAGAAC AAACTGAGAT TGATGGATAT ACTGTACCTC 
Hoi TTAAAGCTAG AGTAATGGTT AATGCATGGG CAATTGGAAG AGATCCTGAA AGTTGGGAAG 

llll Itc^gaaag tttcaaaccc gagcgatttg aaaatatttc tgttgatctt acgggaaatc 

lltl ACTATCAGTT CATTCCTTTC GGTTCAGGAA GAAGAATGTG TCCAGGAATG TCGTTTGGTT 
llll TAGTTAACAC TGGGCATCCT TTAGCTCAGT TGCTCTATTT CTTTGACTGG AAATTCCCTC 
l£l ATAAGGTTAA TGCAGCTGAT TTTCACACTA CTGAAACAAG TAGAGTTTTT GCAGCAAGCA 
1 tni AAGATGACCT CTACTTGATT CCAACAAATC ACATGGAGCA AGAGTAGCTC TAAATTGAAT 
1561 TCTTGTCTTG GAACGATAAA AGAAGAAACT CCAGCTTGGT CTACATTATT TCTTTTTGCT 
lltl TTATATTAGT ATGGGTGTGT TCAGTTTCTT GTTTTTAAGG GTACCCTGAA AGATAAAGGG 
1681 CTATATAAAC CAGTGAGACT TTTTATTGAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

SEQ. ID. NO. 206 IALLLFISFL FILLKKWNTK IPKLPPGPWR LPLIGSLHHL KGKLPHHHLR 

61 YLQLGEVPW VISSPRIAKA VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 

121 APYGDYWRQM RKILTQELLS NKMLKSFSTI RKDELSKLLS SIRLATASSA VMINEKLLWF 

III tIcotcIlaf gkicndrdel imlireiial sggfdvcdlf pswkllhnms nmkarltnvh 
241 h™ inehkenhaa gikgnnefgg edmidallrv kennelqfpi ™kavil 

101 DLFIAGTETS YTAIIWALSE LMKHPSVMAK AQAEVRKVFK ENENLDENDL DKLPYLKSVI 
III KETLP^PPV PLLGPRECRE QTEIDGYTVP LKARVMVNAW AIGRDPESWE DPESFKPERF 

421 enisZtgn hyqf?pfgsg rrmcpgmsfg lvntghplaq llylfdwkfp hkvnaadfht 

481 TETSRVFAAS KDDLYLIPTN HMEQE 
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NAME D207-AC4 
ORGANISM NICOTIANA TABACUM 

SEQ. ID. ^°^^ ACCTT CCTTTTCTTA CTTAGTAAAA TGGATATTCA GTCTTCTCCT TTCAACTTAA 
61 TTGCTTTGCT ACTCTTCATT TCATTTCTTT TTATCCTATT GAAAAAGTGG AATACCAAAA 
121 TCCCAAAGTT ACCTCCAGGT CCATGGAGAC TTCCCCTTAT TGGCAGCCTC CATCACTTGA 
181 AAGGTAAACT CCCACACCAT CATCTTAGAG ATTTAGCCCG AAAATATGGA CCTCTCATGT 
241 ATTTACAACT TGGAGAAGTT CCTGTAGTTG TAATATCTTC GCCACGTATA GCAAAAGCTG 
301 TACTAAAAAC TCATGATCTT GCTTTTGCAA CGAGGCCTCG GTTCATGTCC TCGGACATTG 
361 TGTTTTACAA AAGCAGGGAC ATATCATTCG CCCCATATGG CGATTACTGG AGAGAAATGC 
421 GTAAAATATT AACACAAGAA CTCTTGAGTA ACAAGATGCT CAAGTCATTT AGCACAATCC 
481 GAAAGGATGA GCTCTCGAAG CTCCTCTCGT CGATTCGTTT AGCAACAGCT TCTTCTGCAG 
541 TGAACATAAA CGAAAAGCTT CTCTGGTTTA CAAGTTGCAT GACTTGTAGA TTAGCCTTTG 
601 GAAAAATATG CAACGATCGT GATGAATTGA TTATGTTAAT AAGGGAGATA TTAGCATTAT 
661 CAGGAGGATT TGATGTGTGT GATTTGTTCC CTTCATGGAA ATTACTTCAC AAT AT GAGCA 
721 ACAT GAAAGC TAGATTGACG AATGTTCACC ATAAGTATAA TCTAATTATG GAGAAT AT CA 
781 TCAATGAGCA CAAAGAGAAT CATGCAGCAG GGATAAAGGG AAATAACGAG TTTGGTGGCG 
841 AAGATATGAT TGATGCTTTA CTGAGGGTTA AGGAGAATAA TGAGCTTCAA TTTCCTATCG 
901 AAAATGACAA CATGAAAGCA GTAATTCTGG ACTTGTTTAT TGCTGGAACT GAAACTTCAT 
961 ATACT GCAAT TATATGGGCA CTATCAGAAT T GAT GAAGCA CCCAAGTGTT ATGGCCAAGG 
1021 CACAAGCTGA AGT GAGAAAA GTCTTCAAAG AAAATGAAAA CTTGGACGAA AATGATCTTG 
1081 ACAAGTTGCC ATACT T AAAA TCAGTGATCA AAGAAACACT AAGGATGCAT CCTCCAGTTC 
1141 CTTTATTAGG ACCTAGAGAA TGCAGAGAAC AAACTGAGAT TGATGGATAT ACTGTACCTC 
1201 TTAAAGCTAG AGTAATGGTT AATGCATGGG CAATTGGAAG AGATCCTGAA AGTTGGGAAG 
1261 ATCCTGAAAG TTTCAAACCC GAGCGATTTG AAAATATTTC TGTTGATCTT ACGGGAAATC 
1321 ACTATCAGTT CATTCCTTTC GGTTCAGGAA GAAGAATGTG TCCAGGAATG TCGTTTGGTT 
1381 TAGTTAACAC TGGGCATCCT TTAGCTCAGT TGCTCTATCT CTTTGACTGG AAATTCCCTC 
1441 ATAAGGTTAA TGCAGCTGAT TTTCACACTA CTGAAACAAG TAGAGTTTTT GCAGCAAGCA 
1501 AAGATGACCT CTACTTGATT CCAACAAATC ACATGGAGCA AGAGTAGCTC TAAATTGAAT 
1561 TCTTGTCTTG GAACAATAAA AGAAGAAACT CCAGCTTGGT CTACATTATT TCCTTTTGCT 
1621 TTATATTAGT ATGGGTGTGT TCAGTCTCTT GTTTTTAAGG GTACCCTGAA AGATAAAGGG 
1681 CTATATAAAC CAGTGAGACT TTTTATTGGT TGCAAGGTTT TAGATCAAGC CATAAGACAG 
1741 CATATTTTAT TCCACCATTT TCTATCATGT TTAATAAAGT TCCTTTCGTT TATTGTTAGA 
1801 AAAAAAAAAA AAAAAAAAAA AAA 



: ♦ IN \J • ^UO 

1 MDIQSSPFNL IALLLFISFL FILLKKWNTK IPKLPPGPWR LPLIGSLHHL KGKLPHHHLR 
61 DLARKYGPLM YLQLGEVPW VISSPRIAKA VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 
121 APYGDYWRQM RKILTQELLS NKMLKSFSTI RKDELSKLLS SIRLATASSA VNINEKLLWF 
181 TSCMTCRLAF GKICNDRDEL IMLIREILAL SGGFDVCDLF PSWKLLHNMS NMKARLTNVH 
241 HKYNLIMENI INEHKENHAA GIKGNNEFGG EDMIDALLRV KENNELQFPI ENDNMKAVIL 
301 DLFIAGTETS YTAIIWALSE LMKHPSVMAK AQAEVRKVFK ENENLDENDL DKLPYLKSVI 
361 KETLRMHPPV PLLGPRECRE QTEIDGYTVP LKARVMVNAW AIGRDPESWE DPESFKPERF 
421 ENISVDLTGN HYQFIPFGSG RRMCPGMSFG LVNTGHPLAQ LLYLFDWKFP HKVNAADFHT 
481 TETSRVFAAS KDDLYLIPTN HMEQE 
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NAME D209-AA10 
ORGANISM N I COT I ANA TABACUM 



SEQ. ID. NO. 209 TAACCAAAAT GCAGTTCTTC AGCCTGGTTT 

1 ATATGCAACT GAGATTTGAA GAATACCAAC TAACCAAAAT TccftATAGCC 

61 CCATTTTCCT ATTTCTATCT "TCCT CTT1 1 ACTAGGAAGT ATGCTTCATA 

121 AAAGCAAAAA GTTGCCACCA GGTCCATGGA AACWCCAAT GGACCACTTA 
181 TGGTTGGTGG ACTACCACAC CATGTCCTTA «^TTAGC ACGGCAAAAG 
241 TGCACCTTCA ATTAGGTGAA GTTTCTGCGG TTGTGGTTAC ^ GCCCCGGAGA 
301 AAGTATTAAA AACTCATGAC ATCGCTTTTG CGTCTAGGCC TAGCCTTTT 
361 TTGTCTGTTA CAATAGGTCT GATCTAGCCT TTTGCCCCTA TTTAGCTCTA 
421 TGCGTAAAAT ATGTGTCTTG GAAGTGCTCA GTGCCAAGAA TGTTCGGACA ™^ 
481 TTAGGCGGAA TGAAGTTCTT CGTCTCATTA ATTTTATCCG GTCATCTT TCAGCGTTTG 
541 TTAATGTTAC GGAAAGGATC TTTTTGTTCA CAAGCTCCAT ^AT^ MACTCTTAG 
601 GGCAAGTGTT CAAAGAGCAA GACAAATTTA TACAAOiAAi GTGCTCAGTG 
661 CAGGAGGGTT TGATGTGGCT GACATATTCC CTTCACTGAA ™TCAT 
721 GAATGAAGGG TAAGATTATG AATGCACACC ATAAGGTAGA ^ GGAGGTGAAG 
781 TCAATGAGCA CAAGAAAAAT CTTGCAATTG GGAAAACTAA ™ CCT ATCACCA 
841 ATTTAATTGA TGTTCTTCTA AGACTTATGA ATGATGGAGG CCTTCAA^ acttcatcgt 
901 ACGACAACAT CAAAGCTATA ATTTTTGACA £111^ AGCCGTATTC GCGAAAGCTC 
961 CAACAATTGT GTGGGCTATG GTAGAAATGG JGAAAAATCC AGCC ^nGGMG 
1021 AAGCAGAAGT AAGAGAAGCA TTTAGAGGAA AAGAAACTTT £GAT CCGGTTC CAC 
1081 AGCTAAACTA CCTAAAGTTA GTAATAAAAG AAACTCTAAG ACTT attcctgtaa 
1141 TTTTGCTCCC AAGAGAATGT AGGGAAGAGA CAAATAlAftA TrcAAAATAT TGGAATGACG 
1201 AGACCAAAGT CATGGTTAAT GTTTGGGCTT ^GGAAGAGA TCCAAAATA „ 
1261 CAGAAACTTT TATGCCAGAG AGATTTGAGC AGTGCTCTAA ^™ TTTGGCTTAG 
1321 TTGAATATCT TCCATTTGGT GGCGC^AGGA GGATTTGTCC CTCCCTGCTG 
1381 CTAATGCTTA TTTGCCATTG GCTCAATTAC ^AlAlUft AGTAACTGCC GCTAGAAAAA 
1441 GAATCGAACC AAGCGACTTG GACTTGACTG AGTTGGTTGG AGTAACT AATGGTTTCA 
1501 GTGACCTTTA CTTGGTTGCG ACTCCTTATC AACCTCCTCA AAAGTGA TTTTCGGTTT 
iSi SSS SSSSS S TATAGGAATG TACCATGTTG TCG 

SEC - ID " i°mqlrfeeyql t = vs xe, = = ns Q sk = k ,p = 

£ ^SSSSS S5EKS *= SSKS ESSE 

181 NVTERIFLFT SSMTCRSAFG QVFKEQDKFI LIDVLLRLMN DGGLQFPITN 

loi DN = SKSS = — SS5SS 

ffi SSSSS CSS SS5K SEE ESS* VHF.WK.PAG 
481 IEPSDLDLTE LVGVTAARKS DLYLVATPYQ PPQK 
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NAME D209-AA12 
ORGANISM N I COT I ANA TABACUM 

SEQ. ID. N0^211^ ct GAGATTTGAA GAATACCAAC TAACCAAAAT GCAGTTCTTC AGCTTGGTTT 
61 CCATTTTCCT ATTTCTATCT TTCCTCTTTT TGTTAAGGAT ATGGAAGAAC TCCAATAGCC 
121 AAAGCAAAAA GTTGCCACCA GGTCCATGGA AACTACCAAT ACTAGGAAGT ATGCTTCATA 
181 TGGTTGGTGG ACTACCACAC CATGTCCTTA GAGATTTAGC CAAAAAATAT GGACCACTTA 
241 TGCACCTTCA ATTAGGTGAA GTTTCTGCGG TTGTGGTTAC TTCTCCTGAT ACGGCAAAAG 
301 AAGTATTAAA AACTCATGAC ATCGCTTTTG CGTCTAGGCC TAGCCTTTTG GCCCCGGAGA 
361 TTGTCTGTTA CAATAGGTCT GATCTAGCCT TTTGCCCCTA TGGCGACTAT TGGAGACAAA 
421 TGCGTAAAAT ATGTGTCTTG GAAGTGCTCA GTGCCAAGAA TGTTCGGACA TTTAGCTCTA 
481 TTAGGCGGAA TGAAGTTCTT CGTCTCATTA ATTTTATCCG GTCATCTTCT GGTGAACCTA 
541 TTAATGTTAC GGAAAGGATC TTTTTGTTCA CAAGCTCCAT GACATGTAGA TCAGCGTTTG 
601 GGCAAGTGTT CAAAGAGCAA GACAAATTTA TACAACTAAT TAAAGAAGTG ATACTCTTAG 
661 CAGGAGGGTT TGATGTGGCT GACATATTCC CTTCACTGAA GTTTCTTCAT GTGCTCAGTG 
721 GAATGAAGGG TAAGATTATG AATGCACACC ATAAGGTAGA TGCCATTGTT GAGAATGTCA 
781 TCAATGAGCA CAAGAAAAAT CTTGCAATTG GGAAAACTAA TGGAGCGTTA GGAGGTGAAG 
841 ATTTAATTGA TGTTCTTCTA AGACTTATGA ATGATGGAGG CCTTCAATTT CCTATCACCA 
901 ACGACAACAT CAAAGCCATA ATTTTTGACA TGTTTGCTGC CGGGACAGAG ACTTCATCGT 
961 CAACAATTGT GTGGGCTATG GTAGAAATGG TGAAAAATCC AGCCGTATTC GCGAAAGCTC 
1021 AAGCAGAAGT AAGAGAAGCA TTTAGAGGAA AAGAAACTTT CGATGAAAAT GATGTGGAGG 
1081 AGCTAAACTA CCTAAAGTTA GTAATAAAAG AAACTCTAAG ACTTCATCCA CCGGTTCCAC 
1141 TTTTGCTCCC AAGAGAATGT AGGGAAGAGA CAAATATAAA CGGCTACACT ATTCCTGTAA 
1201 AGACCAAAGT CATGGTTAAT GTTTGGGCTT TGGGAAGAGA TCCAAAATAT TGGAATGACG 
1261 CAGAAACTTT TATGCCAGAG AGATTTGAGC AGTGCTCTAA GGATTTTGTT GGTAATAATT 
1321 TTGAATATCT TCCATTTGGT GGCGGAAGGA GGATTTGTCC TGGGATTTCG TTTGGCTTAG 
1381 CTAATGCTTA TTTGCCATTG GCTCAATTAC TATATCACTT CGATTGGAAA CTCCCTGCTG 
1441 GAATCGAACC AAGCGACTTG GACTTGACTG AGTTGGTTGG AGTAACTGCC GCTAGAAAAA 
1501 GTGACCTTTA CTTGGTTGCG ACTCCTTATC AACCTCCTCA AAAGTGATTT AATGGTTTCA 
1561 AGTTTTTATT TCCTAGCAAA CCCCACTATT GTCCTATCTT TCTTTTGGTG TTTTCGGTTT 
1621 TATCTACTCT AATACATGCA TCTTTTACCA TATAGGAATG TACCATGTTG TCG 



1 MQLRFEEYQL TKMQFFSLVS IFLFLSFLFL LRIWKNSNSQ SKKLPPGPWK LPILGSMLHM 
61 VGGLPHHVLR DLAKKYGPLM HLQLGEVSAV WTSPDTAKE VLKTHDIAFA SRPSLLAPEI 
121 VCYNRSDLAF CPYGDYWRQM RKICVLEVLS AKNVRTFSSI RRNEVLRLIN FIRSSSGEPI 
181 NVTERIFLFT SSMTCRSAFG QVFKEQDKFI QLIKEVILLA GGFDVADIFP SLKFLHVLSG 
241 MKGKIMNAHH KVDAIVENVI NEHKKNLAIG KTNGALGGED LIDVLLRLMN DGGLQFPITN 
301 DNIKAIIFDM FAAGTETSSS TIVWAMVEMV KNPAVFAKAQ AEVREAFRGK ETFDENDVEE 
361 LNYLKLVIKE TLRLHPPVPL LLPRECREET NINGYTIPVK TKVMVNWAL GRDPKYWNDA 
421 ETFMPERFEQ CSKDFVGNNF EYLPFGGGRR ICPGISFGLA NAYLPLAQLL YHFDWKLPAG 
481 IEPSDLDLTE LVGVTAARKS DLYLVATPYQ PPQK 



FIG. 107 



NAME D209-AH10 
ORGANISM N I COT I ANA TABACUM 

SEQ. ID. N0^213^ ct ^^j^opj, GAATACCAAC TAACCAAAGT GCAGTTCTTC AGCTTGGTTT 
61 CCATTTTCCT ATTTCTATCT TTCCTCTTTT TGTTAAGGAT ATGGAAGAAC TCCAATAGCC 
121 AAAGCAAAAA GTTGCCACCA GGTCCATGGA AACTACCAAT ACTAGGAAGT ATGCTTCATA 
181 TGGTTGGTGG ACTACCACAC CATGTCCTTA GAGATTTAGC CAAAAAATAT GGACCACTTA 
241 TGCACCTTCA ATTAGGTGAA GTTTCTGCGG TTGTGGTTAC TTCTCCTGAT ACGGCAAAAG 
301 AAGTATTAAA AACTCATGAC ATCGCTTTTG CGTCTAGGCC TAGCCTTTTG GCCCCGGAGA 
361 TTGTCTGTTA CAATAGGTCT GATCTAGCCT TTTGCCCCTA TGGCGACTAT TGGAGACAAA 
421 TGCGTAAAAT ATGTGTCTTG GAAGTGCTCA GTGCCAAGAA TGTTCGGACA TTTAGCTCTA 
481 TTAGGCGGAA TGAAGTTCTT CGTCTCATTA ATTTTATCCG GTCATCTTCT GGTGAACCTA 
541 TTAATGTTAC GGAAAGGATC TTTTTGTTCA CAAGCTCCAT GACATGTAGA TCAGCGTTTG 
601 GGCAAGTGTT CAAAGAGCAA GACAAATTTA TACAACTAAT TAAAGAAGTG ATACTCTTAG 
661 CAGGAGGGTT TGATGTGGCT GACATATTCC CTTCACTGAA GTTTCTTCAT GTGCTCAGTG 
721 GAATGAAGGG TAAGATTATG AATGCACACC ATAAGGTAGA TGCCATTGTT GAGAATGTCA 
781 TCAATGAGCA CAAGAAAAAT CTTGCAATTG GGAAAACTAA TGGAGCGTTA GGAGGTGAAG 
841 ATTTAATTGA TGTTCCTCTA AGACTTATGA ATGATGGAGG CCTTCAATTT CCTATCACCA 
901 ACGACAACAT CAAAGCTATA ATTTTTGACA TGTTTGCTGC CGGGACGGAG ACTTCATCGT 
961 CAACAATTGT GTGGGCTATG GTAGAAATGG TGAAAAATCC AGCCGTATTC GCGAAAGCTC 
1021 AAGCAGAAGT AAGAGAAGCA TTTAGAGGAA AAGAAACTTT CGATGAAAAT GATGTGGAGG 
1081 AGCTAAACTA CCTAAAGTTA GTAATAAAAG AAACTCTAAG ACTTCATCCA CCGGTTCCAC 
1141 TTTTGCTCCC AAGAGAATGT AGGGAAGAGA CAAATATAAA CGGCTACACT ATTCCTGTAA 
1201 AGACCAAAGT CATGGTTAAT GTTTGGGCTT TGGGAAGAGA TCCAAAATAT TGGAATGACG 
1261 CAGAAACTTT TATGCCAGAG AGATTTGAGC AGTGCTCTAA GGATTTTGTT GGTAATAATT 
1321 TTGAATATCT TCCATTTGGT GGCGGAAGGA GGATTTGTCC TGGGATTTCG TTTGGCTTAG 
1381 CTAATGCTTA TTTGCCATTG GCTCAATTAC TATATCACTT CGATTGGAAA CTCCCTGCTG 
1441 GAATCGAACC AAGCGACTTG GACTTGACTG AGTTGGTTGG AGTAACTGCC GCTAGAAAAA 
1501 GTGACCTTTA CTTGGTTGCG ACTCCTTATC AACCTCCTCA AAAGTGATTT AATGGTTTCA 
1561 AGTTTTTATT TCCTAGCAAA CCCCACTATT GTCCTATCTT TCTTTTGGTG TTTTCGGTTT 
1621 TATCTACTCT AATACATGCA TCTTTTACCA TATAGGAATG TACCAT GTT G TCG 

SEQ. ID. NO. 214 ^ TKVQFFSLVS IFLFLSFLFL LRIWKNSNSQ SKKLPPGPWK LPILGSMLHM 
61 VGGLPHHVLR DLAKKYGPLM HLQLGEVSAV WTSPDTAKE VLKTHDIAFA SRPSLLAPEI 
121 VCYNRSDLAF CPYGDYWRQM RKICVLEVLS AKNVRTFSSI RRNEVLRLIN FIRSSSGEPI 
181 NVTERIFLFT SSMTCRSAFG QVFKEQDKFI QLIKEVILLA GGFDVADIFP SLKFLHVLSG 
241 MKGKIMNAHH KVDAIVENVI NEHKKNLAIG KTNGALGGED LIDVPLRLMN DGGLQFPITN 
301 DNIKAIIFDM FAAGTETSSS TIWAMVEMV KNPAVFAKAQ AEVREAFRGK ETFDENDVEE 
361 LNYLKLVIKE TLRLHPPVPL LLPRECREET NINGYTIPVK TKVMVNWAL GRDPKYWNDA 
421 ETFMPERFEQ CSKDFVGNNF EYLPFGGGRR ICPGISFGLA NAYLPLAQLL YHFDWKLPAG 
481 IEPSDLDLTE LVGVTAARKS DLYLVATPYQ PPQK 



FIG. 108 



NAME D87A-AF3 
ORGANISM N I COT I ANA TABACUM 

SEQ ' l^GAAATGGGAA ATGCTCACAA CAGCAAAATT GCAGCAATCT GTTTGATAAT TTTCTTGGTA 

61 TATAAAGCAT GGGAATTGTT GAAGTGGATA TGGATTAAGC CAAAGAAACT GGAGAGTTGC 
121 CTCAGAAAAC AGGGACTCAA AGGAAATTCC TACAGGCTAT TCTATGGAGA TATGAAAGAA 
181 TTGTCCAAAA GTCTCAAGGA AATCAATTCA AAGCCCATCA TCAATCTATC AAATGAAGTA 
241 GCCCCAAGAA TCATTCCTTA TTATCTTGAA ATCATCCAAA AATATGGTAA AAGATGTTTT 
301 GTTTGGCAAG GACCAACCCC CGCAATATTA ATAACAGAGC CAGAATTAAT AAAGGAGATA 
361 TTTGGTAAGA ACTATGTTTT TCAGAAGCCT AATAATCCCA ACCCACTGAC CAAGTTATTG 
421 GCTCGAGGTG TTGTAAGCTA CGAGGAAGAA AAATGGGCAA AACACAGAAA GATCTTAAAC 
481 CCTGCCTTTC AT AT GGAGAA GTTGAAGCAT ATGCTACCAG CATTTTACTT GAGCTGTAGT 
541 GAGATGCTGA ACAAATGGGA GGAGATTATC CCAGTAAAAG AATCAAATGA GTTGGACATT 
601 TGGCCTCATC TTCAAAGAAT GACAAGTGAT GTGATTTCTC GTGCTGCCTT TGGTAGTAGC 
661 TACGAAGAAG GAAGAAGAAT ATTTGAACTT CAAGAAGAAC AAGCTGAGTA TCTAACGAAG 
721 ACATTCAATT CAGTTTATAT CCCAGGTTCC AGATTTTTTC CCAATAAAAT GAACAAAAGA 
781 ATGAAAGAAT GTGAAAAGGA AGTACGAGAA ACAATTACGT GTCTAATTGA CAACAGATTA 
841 AAGGCAAAAG AAGAAGGCAA TGGCAAGGCC CTCAATGATG ACCTACTGGG TATATTATTA 
901 GAGTCAAATT CTATAGAAAT TGAAGAACAT GGTAACAAGA AGTTTGGAAT GAGTATACCT 
961 GAAGTAATTG AAGAGT GCAA ATTATTCTAT TTTGCTGGCC AAGAGACTAC ATCAGTATTG 
1021 CTTGTGTGGA CACTGATTTT GTTAGGGAGA AATCCAGAAT GGCAGGAACG TGCTAGAGAG 
1081 GAAGTTTTTC AAGCCTTTGG AAGTGATAAA CCAACTTTTG ACGAATTATA TCGCTTGAAA 
1141 ATTGTGACGA TGATTTTGTA CGAGTCTTTA AGGTTATATC CACCAATAGC AACTCGTACT 
1201 CGAAGGACTA ATGAAGAAAC AAAATTAGGG GAACTAGATT TACCAAAGGG TGCACTGCTC 
1261 TTTATACCAA CAATCTTATT ACATCTTGAC AAGGAAATTT GGGGTGAAGA TGCAGATGAG 
1321 TTCAATCCGG AGAGATTTAG CGAAGGGGTG GCAAAGGCAA CAAAGGGGAA AATGACATAT 
1381 TTTCCATTTG GTGCAGGACC GCGAAAATGC ATTGGGCAAA ACTTCGCGAT TTTGGAAGCA 
1441 AAAATGGCTA TAGCTATGAT TCTACAACGC TTCTCCTTCG AGCTCTCTCC ATCTTATACA 
1501 CACTCTCCAT ACACTGTGGT CACTTTGAAA CCCAAATATG GTGCTCCCCT AATAATGCAC 
1561 AGGCTGTAGT CCTGTGAGAA 

SEQ. ID ' N ^H^SKIA AICLIIFLVY KAWELLKWIW IKPKKLESCL RKQGLKGNSY RLFYGDMKEL 
61 SKSLKEINSK PIINLSNEVA PRIIPYYLEI IQKYGKRCFV WQGPTPAILI TEPELIKEIF 
121 GKNYVFQKPN NPNPLTKLLA RGWSYEEEK WAKHRKILNP AFHMEKLKHM LPAFYLSCSE 
181 MLNKWEEIIP VKESNELDIW PHLQRMTSDV ISRAAFGSSY EEGRRIFELQ EEQAEYLTKT 
241 FNSVYIPGSR FFPNKMNKRM KECEKEVRET ITCLIDNRLK AKEEGNGKAL NDDLLGILLE 
301 SNSIEIEEHG NKKFGMSIPE VIEECKLFYF AGQETTSVLL VWTLILLGRN PEWQERAREE 
361 VFQAFGSDKP TFDELYRLKI VTMILYESLR LYPPIATRTR RTNEETKLGE LDLPKGALLF 
421 IPTILLHLDK EIWGEDADEF NPERFSEGVA KATKGKMTYF PFGAGPRKCI GONFAILEAK 
481 MAIAMILQRF SFELSPSYTH SPYTWTLKP KYGAPLIMHR L 
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NAME D208-AC8 
ORGANISM NICOTIANA TABACUM 

SEQ. ID. m. HI c ccATAGAAGC CTTTGTAGGA CTAGTAACCT TCACATTTCT CTTATACTTC 
61 CTATGGACAA AAAAATCTCA AAAACTTCCA AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
121 TGGCCGGTAA TCGGCCATCT TTTTCACTTC AATAACGACG GCGACGACCG TCCATTAGCT 
111 CGAAAGCTCG GAGACTTAGC TGATAAATAC GGCCCCGTTT TCACTTTTCG GCTAGGTCTT 
241 CC^TGTGC TAGTT GTAAG CAGTTACGAA GCTATAAAAG ATTGCTTCTC T ACAAAT GAT 
111 GCCATTTTCT CCAATCGTCC AGCTCTTCTT TACGGCGAAT ACCTTGGCTA CAATAATACA 
361 ATGCTTTTTC TAGCAAATTA CGGACCTTAC TGGCGAAAAA ATCGTAAATT AGTCATTCAG 

421 Stctct ctgctagtcg tctcgaaaaa ttcaaacaag tgagattcac cagaattcaa 

Ifll ATGAGCATTA AGAATTTATA CACTCGAATT AATGGAAATT CGAGTACGAT AAATCTAACT 
111 S« TTTTGGTCTG ATCGTGAAAA TGATCGCTGG GAAAAATTAT 

601 GAATCCGGTA AAGGAGATGA ACAAGTGGAA AGATTTAAGA ATGCGTTTAA GGATTTTATG 
111 GTTTTATCAA TGGAATTTGT ATTATGGGAT GCATTTCCAA TTCCATTATT TAAATGGGTG 
ill GATTTTCAAG GTCATATTAA GGCAATGAAA AGGACATTTA AGGATATAGA TTCTGTTTTT 
ill TAGAGGAACA TATTAATAAA AGAGAAAAAA TAGAGGTTGG TGCAGAAGGG 

Ml AATGAACAAG ATTTCATTGA TGTGGTGCTT TCAAAATTGA GTAAAGAATA TCTTGATGAA 
HI ^TTACTCTC GTGATACTGT CATTAAAGCA ACAGTTTTTA GTTTGGTCTT GGATGCAGCA 
111 S CTCTTCACAT AAATTGGGGA ATGACATTAT TGATAAACAA TCAAAATGCC 
1021 SSaaIg cacaagaaga GATAGACACA AAAGTTGGTA AGGATAGATG ggtagaagag 
1081 AGTGATATTA aggatttagt atacctccaa gctattgtta aaaaggtgtt acgattatat 
HA cSc^ggIc ctttgttagt accacatgaa aatgtaaagg attgtgttgt tagtggatat 
S cSattccta aagggactag attattcgca aacgtcatga aactgcagcg cgatcctaaa 
],,] ctct^tcAA atcctgataa gttcgatcca gagagattca tcgctggtga tattgacttc 
)\t\ CGTGGTCACC actatgagtt tatcccattt ggttctggaa gacgatcttg tccggggatg 

\lll ACTTATGCAT SS ACACCTAACA ATGGCACATT TAATCCAGGG TTTCAATTAC 
ITA AAAACTCCAA ATGACGAGGC CTTGGATATG AAGGAAGGTG CAGGCATAAC AATACGTAAG 

itoi Stcca^ tggaattgat aataacgcct cgcttggcac ctgagcttta ctaaaaccta 
1561 AGATGTTTCA tcttggttga tcattgt 

SEQ. ID. N0^218 LVTFTFLLYF LWTKKSQKLP KPLPPKIPGG WPVIGHLFHF NNDGDDRPLA 

61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AIKDCFSTND AI FSNRPALL YGEYLGYNNT 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK FKQVRFTRIQ TSIKNLYTRI NGNSSTINLT 

8 S iXagkny ESGKGDEQVE rfknafkdfm vlsmefvlwd afpiplfkwv 

241 DFOGHIKAMK RTFKDIDSVF QNWLEEHINK REKIEVGAEG NEQDFIDWL SKLSKEYLDE 
301 GYSRDTVIKA TVFSLVLDAA DTVALHINWG MTLLINNQNA LMKAQEEIDT KVGKDRWVEE 

111 sdikSlvylq aivkkvlrly ppgpllvphe nvkdcwsgy hipkgtrlfa nvmklqrdpk 
421 llsnpdkfdp erfiagdidf rghhyefipf gsgrrscpgm tyalqvehlt mahliqgfny 
481 ktpndealdm kegagitirk vnpveliitp rlapely 
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NAME D215-ABS 
ORGANISM NICOTIANA TABACUM 

SEQ. ID. "° G CTTCAATATG GAGATACCAT ATTACAGCTT AAAAATTGCA ATTTCTTCAT 

61 TTGCAATTAT CTTTGTACTA AGATGGGCAT GGAAAATCTT GAATTATGTG TGGTTAAAAC 

dl caaaaS ggagaaatac ctcagacagc agggtttcaa aggaaactct tacaaattct 

181 TGTTTGGGGA TATGAAAGAG ACGAAGAAAA TGGGTGAAGA AGCTATGTCT AAGCCAATCA 
241 ATTTCTCTCA TGACATGATT TGGCCTAGAG TTATGCCATT CATCCACAAA ACCATCACCA 
301 ATTATGGTAA GAATTGTATT GTGTGGTTTG GGCCAAGACC AGCAGTCCTG ATCACAGACC 
111 CGGAACTTGT AAAGGAGGTG CTAACGAAGA ATTTCGTCTA TCAGAAGCCG CTTGGCAATC 
471 CACTCACAAA GTTGGCAGCA ACTGGAATTG CAGGCTATGA AACAGATAAA TGGGCTACAC 
481 ATAGAAGGCT TCTCAATCCT GCTTTTCACC TTGACAAGTT GAAGCATATG CTACCTGCAT 
TCCAA^T^AC TGCTAGTGAG ATGTTGAGCA AATTGGAGAA AGTTGTTTCA CCAAACGGAA 
601 CAGAGATAGA TGTGTGGCCA TATTTACAAA CTTTGACAAG TGATGCCATT TCAAGAACTG 
661 SSSggaag TAGTTATGAA GAAGGAAGAA AGATTTTTGA CCTTCAAAAA GAACAACTTT 
721 CACTAATTCT AGAAGTTTCA CGCACAATAT ATATTCCAGG ATGGAGGTTT TTGCCAACGA 
781 AAAGGAACAA AAGGATGAAG CAAATATTTA ATGAAGTACG AGCACTGGTA TTTGGAATTA 
841 TTAAGAAAAG GATGAGTATG ATTGAAAATG GAGAAGCACC TGATGATTTA TTGGGAATAT 
901 TATTGGCATC CAATTTAAAA GAAATCCAAC AACATGGAAA CAACAAGAAA TTTGGTATGA 
111 GTMTGATGA GGTGATTGAA GAGT GTAAAC TCTTCTATTT TGCTGGGCAA GAGACTACTT 
1021 CATCTTTACT TGTATGGACT ATGATTTTGT TGTGCAAATA TCCTAATTGG CAAGATAAAG 
1081 CTAGAGAAGA GGTTTTGCAA GTGTTTGGGA GTAGGGAAGT TGACTATGAC AAGTTGAATC 
u" AGCTAAAAAT AGTAACTATG ATCTTAAACG AGGTCTTAAG GTTGTATCCA GCAGGATATG 
All TGATTAATCG AATGGTAAAC AAAGAAACAA AGTTAGGGAA TTTGTGTTTA CCAGCCGGCG 
1261 TACAGCTCGT GTTACCAACA ATGTTGTTGC AACATGATAC TGAAATATGG GGAGATGATG 
1321 CAATGGAGTT CAATCCAGAG AGATTTAGTG ATGGAATATC CAAAGCAACA AAAGGAAAAC 
\H\ ttgTGTTTTT TCCATTTAGT TGGGGTCCAA GAATATGTAT TGGGCAAAAT TTTGCTATGT 
liil ?Igaggctaa AATGGCAATG GCTATGATTC TGAAAACCTA TGCATTTGAA ctctctccat 
1501 ™gctca tgctcctcat ccactactac ttcaacctca atatggtgct caattaattt 

1561 TGTACAAGTT GTAGATATGG TCAATCTGGA ACTTGTTATG GAACTTTTAT CATCGTAATC 
1621 AACCATATTG AGGG 

SEQ. ID. NO 220 m LRWAWKILNY VWLKPKELEK YLRQQGFKGN SYKFLFGDMK 

61 ETKKMGEEAM SKPINFSHDM IWPRVMPFIH KTITNYGKNC IVWFGPRPAV "TDPELVKE 
121 VLTKNFVYQK PLGNPLTKLA ATGIAGYETD KWATHRRLLN PAFHLDKLKH MLPAFQFTAS 

ill emlsklekW spngteidw pylqtltsda isrtafgssy eegrkifdlq keqlslilev 

241 SRTIYIPGWR FLPTKRNKRM KQIFNEVRAL VFGIIKKRMS MIENGEAPDD LLGILLASNL 
301 KEIQQHGNNK KFGMSIDEVI EECKLFYFAG QETTS SLLVW TMILLCKYPN WQDKAREEVL 
361 QVFGSREVDY DKLNQLKIVT MILNEVLRLY PAGYVINRMV NKETKLGNLC LPAGVQLVLP 
421 TMLLQHDTEI WGDDAMEFNP ERFSDGISKA TKGKLVFFPF SWGPRICIGQ NFAMLEAKMA 
481 MAMILKTYAF ELSPSYAHAP HPLLLQPQYG AQLILYKL 
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NAME D103-AH3 
ORGANISM NICOTIANA TABACUM 

SEQ. ID. NO 221 Gc QTTTGTAGGA CTAGTAACCT TCACATTTCT CTTATACTTC 

6i ctatggacaI a^aatctca aaaacttcca aaacccttac caccgaaaat ccccggagga 

" Srrr^CTAA TCGGCCACCT TTTTCACTTC AATAACGACG GCGACGACCG TCCATTAGCT 
m cga1S£ S^AGC TGATAAATAC GGCCCCGTTT tcacttttcg gctaggtctt 
III Cctc^GTGC TAGTTGTAAG CAGTTACGAA GCTACAAAAG ATTGCTTCTC TACAAATGAC 
lm rcCATTTTCT CCAATCGTCC AGCTTTTCTT TACGGCGAAT ACCTTGGCTA CAATAATACA 
361 mgc"tttttc TA^TTA CGGACCTTAC TGGCGAAAAA atcgtaaatt agtcattcag 
lo\ rlaTTTCTCT CTGCTAGTCG TCTCGAAAAA TTCAAACAAG TGAGATTCAC CAGAATTCAA 

SStS Sttata cactcgaatt aatggaaatt CGAGTACGAT aaatctaact 
s.i ^gcttIg a^gaattgaa ttttggtctg atcgtgaaaa tgatcgctgg gaaaaattat 
loi gaItccggta aaggagatga acaagtggaa agatttaaga atgcgtttaa ggattttatg 
601 ^ttatcIa" £gaatttgt attatgggat gcatttccaa ttccattatt taaatgggtg 

721 



i61 = = = 



SS » £££££ Xga^aaaa' tggaggttgg tgcagaaggg 

84^ AATGAACAAG It^CA^TGA TGTGGTGCTT TCAAAATTGA GTAAAGAATA TCTTGATGAA 
Qni GGTTACTCTC GTGATACTGT CATTAAAGCA ACAGTTTTTA GTTTGGTCTT GGATGCAGCA 
SI GACAGAGTTG CT^TCACAT AAATTGGGGA ATGACATTAT TG.TAAACAA TCAAAATGCC 



iltl ^Sg S££a ZZSSZ a^ttggta aggatagatg ggtagaagag 
\oli aStatta aggatttagt atacctccaa gctattgtta aaaaggtgtt acgattatat 

1081 "c CTTTGTTAGT ACCACATGAA AATGTAAAGG ATTGTGTTGT TAGTGGATAT 
CACATTCCTA AAGGGACTAG ATTATTCGCA AACGTCATGA AACTGCAGCG CGATCCTAAA 

cSt C Ja Stcctgataa gttcgatcca gagagattca tcgctggtga tattgact c 



lltl CA^™a « AACGTCATGA AACTGCAGCG CGATCCTAAA 

1761 CT CTT GT CAA ATCCTGATAA GTTCGATCCA GAGAGATTCA TCGCTGGTGA TATTGACTTC 
CGTGGTCACC ACTAtS TATCCCATCT ™GGAA ^CTTG ^CGGGGATG 
ACTTATGCAT TGCAAGTGGA ACACCTAACA ATGGCACATT TAATCCAGGG TTTCAATTAC 
A^ArTCCAA ATGACGAGGT CTTGGATATG AAGGAAGGTG CAGGCATAAC AATACGTAAG 
isol Stcc^g tg^gat Staacgcct CGCTTGGCAC CTGAGCTTTA CTAAAACCTA 



SEQ. ID. NO. 222 LVTFTFLLYF lwtkksqklp kplppkipgg wpvighlfhf nndgddrpla 

61 rklgdladky gpvftfrlgl plvlwssye atkdcfstnd aifsnrpafl vgeylgynnt 
121 SlfSpy wrknrklviq evlsasrlek fkqvrftriq tsiknlytri ngnsstinlt 
ill dwXnfgl iZiagkny esgkgdeqve rfknafkdfm ^smefvlwd afpi™ 
nFOGHIKTMK rtfkdidsvf qnwleehink rekmevgaeg neqdfidwl sklskeylde 
gysrS™ tvfslvldaa dtvalhinwg mtllinnqna lmkaqeeidt kvgkdrwvee 
sdikdlww wwkvlrly ppgpllvphe nvkdcwsgy hipkgtrlfa nvmklqrdpk 

SS ERFIAGDIDF RGHHYEFIPS GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 



241 
301 
361 
421 



4Z± iiljOHtunt uc j.™-. 

481 KTPNDEVLDM KEGAGITIRK VNPVELIITP RLAPELY 
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NAME D208-AD9 
ORGANISM NICOTIANA TABACUM 

SEQ. ID. I™ c c CATAGAAGC CATTGTAGGA CTAGTAACCT TCACATTTCT CTTCTTCTTC 

61 CTATGGACAA AAAAATCTCA AAAACCTTCA AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC AATGACGACG GCGACGACCG TCCATTAGCT 
181 CGAAAACTCG GAGACTTAGC TGACAAATAC GGCCCCGTTT TCACTTTTCG GCTAGGCCTT 
241 CCCCTTGTCT TAGTTGTAAG CAGTTACGAA GCTGTAAAAG ACTGTTTCTC CACAAATGAC 
301 GCCATTTTTT CCAATCGTCC AGCTTTTCTT TACGGCGATT ACCTTGGCTA CAATAATGCC 
361 ATGCTATTTT TGGCCAATTA CGGACCTTAC TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA TTCAAACACG TGAGATTTGC AAGAATTCAA 
481 GCGAGCATGA AGAATTTATA TACTCGAATT GATGGAAATT CGAGTACGAT AAATTTAACT 
541 GATTGGTTAG AAGAATTGAA TTTTGGTCTG ATCGTGAAGA TGATCGCTGG AAAAAATTAT 
601 GAATCCGGTA AAGGAGATGA ACAAGTGGAG AGATTTAAGA AAGCGTTTAA GGATTTTATG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT GCATTTCCAA TTCCATTATT TAAATGGGTG 
721 GATTTTCAAG GGCATGTTAA GGCTATGAAA AGGACTTTTA AAGATATAGA TTCTGTTTTT 
781 CAGAATTGGT TAGAGGAACA TATTAATAAA AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT TCAAAAATGA GTAATGAATA TCTTGGTGAA 
901 GGTTACTCTC GTGATACTGT CATTGAAGCA ACGGTGTTTA GTTTGGTCTT GGATGCAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA ATGGCATTAT TGATAAACAA TCAAAAGGCC 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA AAAGTTTGTA AGGACAGATG GGTAGAAGAG 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA GCTATTGTTA AAGAAGTGTT ACGATTATAT 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA AATGTAGAAG ATTGTGTTGT TAGTGGATAT 
1201 CACATTCCTA AAGGGACAAG ATTATTCGCA AACGTCATGA AACTGCAACG TGATCCTAAA 
1261 CTCTGGTCTG ATCCTGATAC TTTCGATCCA GAGAGATTCA TTGCTACTGA TATTGACTTT 
1321 CGTGGTCAGT ACTATAAGTA TATCCCGTTT GGTCCTGGAA GACGATCTTG TCCAGGGATG 

13 81 ACTTATGCAT TGCAAGTGGA ACACTTAACA ATGGCACATT TGATCCAAGG TTTCAATTAC 

14 41 AGAACTCCAA ATGACGAGCC CTTGGATATG AAGGAAGGTG CAGGCATAAC TATACGTAAG 
1501 GTAAATCCTG TGGAACTGAT AATAGCGCCT CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 
1561 AGATGTTTCA TCTTGGTTGA 

SEQ. ID. ™^J24 g LVTFTFLFFF LWTKKSQKPS KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK FKHVRFARIQ ASMKNLYTRI DGNSSTINLT 
181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 
241 DFOGHVKAMK RTFKDIDSVF QNWLEEHINK REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
301 GYSRDTVIEA TVFSLVLDAA DTVALHINWG MALLINNQKA LTKAQEEIDT KVCKDRWVEE 
361 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE NVEDCWSGY HIPKGTRLFA NVMKLQRDPK 
421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF GPGRRSCPGM TYALQVEHLT MAHLIQGFNY 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP RLAPELY 
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NAME D237-AD1 
ORGANISM N I COT I AN A TABACUM 

SEQ. ID. ^ T ^ TATAC ctTTAGTACT CTTGAAATTT TCAAATAATG GTTTATCTTC TTTCTCCCAT 
61 AGAAGCCATT GTAGGATTTG TAACCTTTTC ATTTCTATTC TACTTTCTAT GGACCAAAAA 
121 ACAATCAAAA ATCTTAAACC CACTACCTCC AAAAATCCCA GGTGGATGGC CAGTAATCGG 
181 CCATCTCTTT TATTTCAAGA ACAATGGCGA TGAAGATCGC CATTTTTCTC AAAAACTCGG 
241 TGACTTAGCT GACAAATATG GTCCCGTCTT CACTTTCCGG TTAGGGTTTC GCCGTTTCTT 
301 GGCGGTGAGT AGTTATGAAG CTAT GAAAGA ATGCTTCACT ACCAATGATA TCCATTTCGC 
361 CGATCGGCCA TCTTTACTCT ACGGAGAATA CCTTTGCTAT AATAACGCCA TGCTTGCTGT 
421 TGCCAAATAT GGCCCTTACT GGAAAAAAAA TCGAAAGTTA GTCAATCAAG AAGTTCTCTC 
481 CGTTAGTCGG CTCGAAAAAT TCAAACATGT TAGATTTTCT ATAATTCAGA AAAATATTAA 
541 ACAATTGTAT AATTGTGATT CACCAATGGT GAAGATAAAC CTTAGTGATT GGATAGATAA 
601 ATTGACATTC GACATCATTT TGAAAATGGT TGTTGGGAAG AACTATAATA AT GGACAT GG 
661 AGAAATACTC AAAGTTGCTT TTCAGAAATT CATGGTTCAA GCTATGGAGA TGGAGCTCTA 
721 TGATGTTTTT CACATTCCAT TTTTCAAGTG GTTGGATCTT ACAGGGAATA TTAAGGCTAT 
781 GAAACAAACT TTCAAAGACA TTGATAATAT TATCCAAGGT TGGTTAGATG AGCACATTAA 
841 GAAGAGAGAA ACAAAGGATG TTGGAGGTGA AAACGAACAA GATTTTATAG ATGTGGTGCT 
901 TTCCAAGATG AGCGACGAAC ATCTTGGCGA GGGTTACTCT CATGACACAA CCATCAAAGC 
961 AACTGTATTC ACTTTGGTCT TGGATGCAAC AGACACACTT GCACTTCATA TAAAGTGGGT 
1021 AATGGCGTTA ATGATAAACA ATAAGCATGT CATGAAGAAA GCACAAGAAG AGATGGACAC 
1081 AATTGTTGGT AGAGATAGAT GGGTAGAAGA GAGTGATATC AAGAATTTGG TGTATCTCCA 
1141 AGCAATTGTC AAAGAAGTAT TACGATTACA TCCACCCGCA CCTTTGTCAG TGCAACACCT 
1201 AT CTGTAGAA GATTGTGTTG TCAATGGGTA CCATATTCCT AAGGGGACTG CACTACTTAC 
1261 CAATATTATG AAACTACAGC GAGATCCTCA AACATGGCCA AATCCTGATA AATTCGATCC 
1321 AGAGAGATTC CTGACGACTC ATGCTACTAT TGACTACCGC GGGCAGCACT ATGAGTCGAT 
1381 CCCCTTTGGT ACGGGGAGAC GAGCTTGTCC CGCGATGAAT TATTCATTGC AAGTGGAACA 
1441 CCTTTCAATT GCTCATATGA TCCAAGGTTT CAGTTTTGCA ACTACGACCA ATGAGCCTTT 
1501 GGATATGAAA CAAGGTGTGG GTTTAACTTT ACCAAAGAAG ACTGATGTTG AAGTGCTAAT 
1561 TACACCTCGC CTTCCTCCTA CGCTTTATCA ATATTAAGAT GTTTTGTTGT CGGGATTCGT 
1621 TCTGATCAAT CCCTCAATG 



1 MVYLLSPIEA IVGFVTFSFL FYFLWTKKQS KILNPLPPKI PGGWPVIGHL FYFKNNGDED 
61 RHFSQKLGDL ADKYGPVFTF RLGFRRFLAV SSYEAMKECF TTNDIHFADR PSLLYGEYLC 
121 YNNAMLAVAK YGPYWKKNRK LVNQEVLSVS RLEKFKHVRF SIIQKNIKQL YNCDSPMVKI 
181 NLSDWIDKLT FDIILKMWG KNYNNGHGEI LKVAFQKFMV QAMEMELYDV FHIPFFKWLD 
241 LTGNIKAMKQ TFKDIDNIIQ GWLDEHIKKR ETKDVGGENE QDFIDWLSK MSDEHLGEGY 
301 SHDTTIKATV FTLVLDATDT LALHIKWVMA LMINNKHVMK KAQEEMDTIV GRDRWVEESD 
361 IKNLVYLQAI VKEVLRLHPP APLSVQHLSV EDCWNGYHI PKGTALLTNI MKLQRDPQTW 
421 PNPDKFDPER FLTTHATIDY RGQHYESIPF GTGRRACPAM NYSLQVEHLS IAHMIQGFSF 
481 ATTTNEPLDM KQGVGLTLPK KTDVEVLITP RLPPTLYQY 
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NAME D125-AF11 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 227 

1 CTTTTTCTCC CCAAAAAAGA GCTCATTTCC CTTGTCCCCA AAAATGGATC TTCTCTTACT 
61 AGAGAAGACC TTAATTGGTC TCTTCTTTGC CATTTTAATC GCTATAATTG TCTCTAGACT 
121 TCGTTCAAAG CGTTTTAAGC TTCCCCCAGG ACCAATCCCA GTACCAGTTT TTGGTAATTG 
181 GCTTCAAGTT GGTGATGATT TAAAC CACAG AAATCTTACT GATTTTGCCA AAAAATTTGG 
241 TGATCTTTTC TTGTTAAGAA TGGGCCAGCG TAATTTAGTT GTTGTGTCAT CTCCTGAATT 
301 AGCTAAAGAA GTTTTACACA CACAAGGTGT TGAATTTGGT TCAAGAACAA GAAATGTTGT 
361 ATTTGATATT TTTACTGGAA AAGGTCAAGA TATGGTTTTT ACTGTATATG GTGAACACTG 
421 GAGAAAAATG AGGAGAATTA TGACTGTACC ATTTTTTACT AATAAAGTTG T GCAGCAAT A 
481 TAGAGGGGGG TGGGAGTTTG AAGTGGCAAG TGTAATTGAG GATGTGAAGA AAAATCCTGA 
541 ATCTGCTACT AATGGGATTG TATTAAGGAG GAGATTACAA TTGATGATGT ATAATAATAT 
601 GTTTAGGATT ATGTTTGATA GGAGATTTGA GAGTGAAGAT GATCCTTTGT TTGTTAAGCT 
661 TAAGGCTTTG AATGGTGAAA GGAGTAGATT GGCTCAGAGT TTTGAGTATA ATTATGGTGA 
721 TTTTATTCCC ATTTTGAGGC CTTTTTTGAG AGGTTATTTG AAGATCTGTA AAGAAGTTAA 
781 GGAGAAGAGG CTGCAGCTTT TCAAAGATTA CTTTGTTGAT GAAAGAAAGA AGCTTTCAAA 
841 TACCAAGAGC TTGGACAGCA ATGCTCTGAA ATGTGCGATT GATCACATTC TTGAGGCTCA 
901 ACAGAAGGGG GAGATCAATG AGGACAACGT TCTTTACATT GTTGAAAACA TCAATGTTGC 
961 TGCTATAGAA ACCACATTAT GGTCAATTGA GTGGGGTATC GCCGAGTTAG TCAACCACCC 
1021 TCACATCCAA AAGAAACTCC GCGACGAGAT TGACACAGTT CTTGGCCCAG GAGT GCAAGT 
1081 GACTGAACCA GACACCCACA AGCTTCCATA CCTTCAGGCT GTGATCAAGG AGACGCTTCG 
1141 TCTCCGTATG GCAATTCCTC TATTAGTCCC ACACATGAAC CTTCACGATG CAAAGCTTGG 
1201 CGGGTTTGAT ATTCCAGCAG AGAGCAAAAT CTTGGTTAAC GCTTGGTGGC TAGCTAACAA 
1261 CCCGGCTCAT TGGAAGAAAC CCGAAGAGTT CAGACCCGAG AGGTTCTTCG AAGAGGAGAA 
1321 GCACGTTGAG GCCAAT GGCA ATGACTTCAG ATATCTTCCG TTTGGCGTTG GTAGGAGGAG 
1381 TTGCCCTGGA ATTATACTTG CATT GCCAAT TCTTGGCATT ACTTTGGGAC GTTTGGTTCA 
1441 GAACTTTGAG CTGTTGCCTC CTCCAGGCCA GTCGAAGCTC GACACCACAG AGAAAGGTGG 
1501 ACAGTTCAGT CTCCATATTT TGAAGCATTC CACCATTGTG TTGAAACCAA GGTCTTGCTG 
1561 AACTTTCTGA TCCTAATCAA TTAAGGGGTT GAAGAAATTT TATAATTATG 



i . In W • <L 6. O 

1 MDLLLLEKTL IGLFFAILIA IIVSRLRSKR FKLPPGPIPV PVFGNWLQVG DDLNHRNLTD 
61 FAKKFGDLFL LRMGQRNLW VSSPEIAKEV LHTQGVEFGS RTRNWFDIF TGKGQDMVFT 
121 VYGEHWRKMR RIMTVPFFTN KWQQYRGGW EFEVASVIED VKKNPESATN GIVLRRRLQL 
181 MMYNNMFRIM FDRRFESEDD PLFVKLKALN GERSRLAQSF EYNYGDFIPI LRPFLRGYLK 
241 ICKEVKEKRL QLFKDYFVDE RKKLSNTKSL DSNALKCAID HILEAQQKGE INEDNVLYIV 
301 ENINVAAIET TLWSIEWGIA ELVNHPHIQK KLRDEIDTVL GPGVQVTEPD THKLPYLQAV 
361 IKETLRLRMA IPLLVPHMNL HDAKLGGFDI PAESKI LVNA WWLANNPAHW KKPEEFRPER 
421 FFEEEKHVEA NGNDFRYLPF GVGRRSCPGI ILALPILGIT LGRLVQNFEL LPPPGQSKLD 
481 TTEKGGQFSL HILKHSTIVL KPRSC 
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NAME D134-AE11 
ORGANISM N I COT I ANA TABACUM 

SEQ. ID. ^°^^ TAAAA AXGGAGAC AT TATTTAACAT CAAAGTTGCA GTTTCATTAG TAATTGTGAT 
61 AATTTTTCTG AGATGGGTAT GGAAATTCTT GAATTGGGTG TGGATTCAAC CAAAGAAAAT 
121 GGAAAAAAGA CTAAAAATGG AAGGTTTCAA AGGAAGCTCA TATAAGCTAT TATTTGGAGA 
181 TATGAAAGAA ATAAATACAA TGGTTGAAGA AGCCAAAACC AAGCCTATGA ATTTTACCAA 
241 TGATTATGTG GCTAGAGTCT TGCCTCACTT CACAAAGTTG ATGCTCCAAT ATGGCAAGAA 
301 TAGCTTTATG TGGTTAGGGC CAAAACCAAC AATGTTTATC ACAGACCCTG AACTAATAAG 
361 GGAGATCTTG TCAAAAAGTT ACATATACCA GGAGATTCAA GGCAATCCAA TCACTAAGTT 
421 GCTAGCACAA GGACTAGTAA GTTATGAAGC AGAGAAATGG GCTAAGCATA GAAAAATTAT 
481 CAATCCTGCA TTTCACCTTG ACAAGTTGAA GCATATGCTA CCATCATTCT ACTTGAGTTG 
541 TTGTGACATG CTCAGAAAAT GGGAAAGTAT AGCTTCATCA GAGGGATCAG AAATAGACGT 
601 GTGGCCTTTT CTGGAAACGT TGACAAGCGA TGCTATTTCA AGAACAGCTT TTGGTAGTAA 
661 CTATGAAGAC GGGAGACAGA TATTTGAGCT TCAAAAAGAA CAAGCTGAGT TGATTTTACA 
721 AGCAGCGCGA TGGCTTTACA TCCCCGGATG GAGGTTTGTG CCAACAAAGA GGAACAAGAG 
781 GATGAAGCAA ATCGCTAAAG AAGTACGATC ATTAGTGTTG GGAATAATCA ATAAGAGAAT 
841 AAGGGAAATG AAAGCAGGGG AAGCTGCAAA AGATGACTTA CTGGGAATAC TATTGGAATC 
901 TAATTTCAAA GAAATCCAAA TGCACGGAAA CAAGAACTTT GGCATGACTA TCGACGAAGT 
961 GATTGAAGAG TGCAAGTTAT TTTACTTTGC TGGGCAAGAA ACTACTTCAG TTTTGCTTGT 
1021 TTGGACTTTG ATTTTACTGA GTAAGCATGT CGATTGGCAA GAAAGAGCTA GAGAAGAAGT 
1081 TCATCAAGTC TTTGGAAGTA ACAAACCTGA TTATGACGCA TTGAATCAGT TGAAAGTTGT 
1141 AACGATGATA TTCAACGAGG TTTTAAGGTT GTACCCACCG GGAATTACCA TAAGTCGAAC 
1201 TGTACACGAG GATACCAAAT TAGGGAACTT GTCATTGCCA GCAGGGATAC AGCTTGTGTT 
1261 ACCTGCAATT TGGTTGCATC ATGACAATGA AATATGGGGA GATGATGCAA AGGAGTTCAA 
1321 ACCAGAGAGG TTTAGTGAAG GAGTTAATAA AGCAACAAAG GGTAAATTTG CATATTTTCC 
1381 ATTTAGTTGG GGACCAAGAA TATGTGTTGG ACTGAATTTT GCAATGTTAG AGGCAAAAAT 
1441 GGCACTTGCA TTGATTCTAC AACACTATGC TTTTGAGCTC TCTCCATCTT ATGCACATGC 
1501 TCCTCATACA ATTATCACTC TGCAACCTCA ACATGGTGCT CCTTTGATTT TGCGCAAGCT 
1561 GTAGCGCGGA TATATTGATT GGTTATCTAC TGTAG 



1 METLFNIKVA VSLVIVIIFL RWVWKFLNWV WIQPKKMEKR LKMEGFKGSS YKLLFGDMKE 
61 INTMVEEAKT KPMNFTNDYV ARVLPHFTKL MLQYGKNSFM WLGPKPTMFI TDPELIREIL 
121 SKSYIYQEIQ GNPITKLLAQ GLVSYEAEKW AKHRKIINPA FHLDKLKHML PSFYLSCCDM 
181 LRKWESIASS EGSEIDVWPF LETLTSDAIS RTAFGSNYED GRQI FELQKE QAELILQAAR 
241 WLYIPGWRFV PTKRNKRMKQ IAKEVRSLVL GIINKRIREM KAGEAAKDDL LGILLESNFK 
301 EIQMHGNKNF GMTIDEVIEE CKLFYFAGQE TTSVLLVWTL ILLSKHVDWQ ERAREEVHQV 
361 FGSNKPDYDA LNQLKWTMI FNEVLRLYPP GITISRTVHE DTKLGNLSLP AGIQLVLPAI 
421 WLHHDNEIWG DDAKEFKPER FSEGVNKATK GKFAYFPFSW GPRICVGLNF AMLEAKMALA 
481 LILQHYAFEL SPSYAHAPHT IITLQPQHGA PLILRKL 



NAME D209-AH12 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 231 

1 ATATGCAACT GAGATTTGAA GAATACCAAC TAACCAAAAT GCAGTTCTTC AGCTTGGTTT 
61 CCATTTTCCT ATTTCTATCT TTCCTCTTTT TGTTAAGGAT ATGGAAGAAC TCCAATAGCC 
121 AAAGCAAAAA GTTGCCACCA GGTCCATGGA AACTACCAAT ACTAGGAAGT ATGCTTCATA 
181 TGGTTGGTGG ACTACCACAC CATGTCCTTA GAGATTTAGC CAAAAAATAT GGACCACTTA 
241 TGCACCTTCA ATTAGGTGAA GTTTCTGCGG TTGTGGTTAC TTCTCCTGAT ACGGCAAAAG 
301 AAGTATTAAA AACTCATGAC ATCGCTTTTG CGTCTAGGCC TAGCCTTTTG GCCCCGGAGA 
361 TTGTCTGTTA CAATAGGTCT GATCTAGCCT TTTGCCCCTA TGGCGACTAT TGGAGACAAA 
421 TGCGTAAAAT ATGTGTCTTG GAAGTGCTCA GTGCCAAGAA TGTTCGGACA TTTAGCTCTA 
481 TTAGGCGGAA TGAAGTTCTT CGTCTCATTA ATTTTATCCG GTCATCTTCT GGTGAACCTA 
541 TTAATGTTAC GGAAAGGATC TTTTTGTTCA CAAGCTCCAT GACATGTAGA TCAGCGTTTG 
601 GGCAAGTGTT CAAAGAGCAA GACAAATTTA TACAACTAAT TAAAGAAGTG ATACTCTTAG 
661 CAGGAGGGTT TGATGTGGCT GACATATTCC CTTGACTGAA GTTTCTTCAT GTGCTCAGTG 
721 GAATGAAGGG TAAGATTATG AATGCACACC ATAAGGTAGA TGCCATTGTT GAGAATGTCA 
781 T CAAT GAGCA CAAGAAAAAT CTTGCAATTG GGAAAACTAA TGGAGCGTTA GGAGGTGAAG 
841 ATTTAATTGA TGTTCTTCTA AGACTTATGA ATGATGGAGG CCTTCAATTT CCTATCACCA 
901 ACGACAACAT CAAAGCCATA ATTTTTGACA TGTTTGCTGC CGGGACAGAG ACTTCATCGT 
961 CAACAATTGT GTGGGCTATG GTAGAAATGG TGAAAAATCC AGCCGTATTC GCGAAAGCTC 
1021 AAGCAGAAGT AAGAGAAGCA TTTAGAGGAA AAGAAACTTT CGATGAAAAT GATGTGGAGG 
1081 AGCTAAACTA CCTAAAGTTA GTAATAAAAG AAACTCTAAG ACTTCATCCA CCGGTTCCAC 
1141 TTTTGCTCCC AAGAGAATGT AGGGAAGAGA CAAATATAAA CGGCTACACT ATTCCTGTAA 
1201 AGACCAAAGT CATGGTTAAT GTTTGGGCTT TGGGAAGAGA TCCAAAATAT TGGAATGACG 
1261 CAGAAACTTT TATGCCAGAG AGATTTGAGC AGTGCTCTAA GGATTTTGTT GGTAATAATT 
1321 TTGAATATCT TCCATTTGGT GGCGGAAGGA GGATTTGTCC TGGGATTTCG TTTGGCTTAG 
1381 CTAATGCTTA TTTGCCATTG GCTCAATTAC TATATCACTT CGATTGGAAA CTCCCTGCTG 
1441 GAATCGAACC AAGCGACTTG GACTTGACTG AGTTGGTTGG AGTAACTGCC GCTAGAAAAA 
1501 GTGACCTTTA CTTGGTTGCG ACTCCTTATC AACCTCCTCA AAAGTGATTT AATGGTTTCA 
1561 AGTTTTTATT TCCTAGCAAA CCCCACTATT GTCCTATCTT TCTTTTGGTG TTTTCGGTTT 
1621 TATCTACTCT AATACATGCA TCTTTTACCA TATAGGAATG TACCATGTTG TCG 

SEQ. ID. NO. 232 

1 MQLRFEEYQL TKMQFFSLVS IFLFLSFLFL LRIWKNSNSQ SKKLPPGPWK LPILGSMLHM 

61 VGGLPHHVLR DLAKKYGPLM HLQLGEVSAV WTSPDTAKE VLKTHDIAFA SRPSLLAPEI 

121 VCYNRSDLAF CPYGDYWRQM RKICVLEVLS AKNVRTFSSI RRNEVLRLIN FIRSSSGEPI 

181 NVTERIFLFT SSMTCRSAFG OVFKEQDKFI QLIKEVILLA GGFDVADI FP SLKFLHVLSG 

241 MKGKIMNAHH KVDAIVENVI NEHKKNLAIG KTNGALGGED LIDVLLRLMN DGGLQFPITN 

301 DNIKAIIFDM FAAGTETSSS TIVWAMVEMV KNPAVFAKAQ AEVREAFRGK ETFDENDVEE 

361 LNYLKLVIKE TLRLHPPVPL LLPRECREET NINGYTIPVK TKVMVNVWAL GRDPKYWNDA 

421 ETFMPERFEQ CSKDFVGNNF EYLPFGGGRR ICPGISFGLA NAYLPLAQLL YHFDWKLPAG 

481 IEPSDLDLTE LVGVTAARKS DLYLVATPYQ PPQK 
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NAME D221-BB8 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 233 

1 GAATTATTTC ACGTGTTGTA TTCCTTGTCT ATGATAGGAA GCTCGTTACC TCAGCGTACA 
61 AACCCCAAAT AAAAAATGAA TTTCCTTGTG GTGTTAGCTT CTCTCTTTCT CTTTGTGTTC 
121 CTAATGAGGA TAAGCAAAGC AAAAAAGCTC CCTCCAGGTC CAAGGAAACT GCCTATAATA 
181 GGAAACCTTC ATCAAATTGG AAAATTACCT CATCGTTCAC TTCAAAAACT TTCTAATGAA 
241 TATGGGGATT TCATTTTCTT GCAATTAGGT TCTGTACCGA CTGTGGTTGT CTCCTCAGCT 
301 GACATTGCCC GAGAGATCTT TAGAACTCAC GACCTTGTTT TCTCAGGCCG TCCTGCTTTA 
361 TATGCTGCCA GAAAACTTTC CTACAATTGC TACAACGTTT CATTTGCACC CTATGGTAAT 
421 TACTGGAGAG AGGCTCGGAA AATTCTAGTG TTGGAGTTGC TAAGTACAAA GAGAGTACAA 
481 AGTTTCGAGG CAATTCGAGA CGAGGAAGTA AGTAGCTTGG TTCAAATTAT CTGTAGTTCC 
541 TTGAGCTCAC CTGTTAACAT AAGCACATTA GCACTATCCT TGGCAAATAA CGTTGTTTGT 
601 CGAGTGGCTT TTGGGAAAGG GAGTGCTGAA GGAGGAAATG ATTATGAGGA TAGGAAGTTT 
661 AATGAAATTC TATATGAGAC ACAAGAATTA TTGGGTGAGT TTAACGTTGC TGATTATTTT 
721 CCTCGGATGG CATGGATTAA CAAAATAAAT GGGTTTGATG AACGATTGGA AAATAATTTT 
781 AGGGAATTGG ATAAGTTTTA TGACAAAGTA ATAGAAGATC ATCTTAATTC ATGTAGCTGG 
841 ATGAAACAAA GGGATGATGA AGACGTTATT GATGTATTGC TTCGAATTCA AAAGGATCCA 
901 AGCCAAGAAA TTCCTCTCAA AGATGATCAC ATTAAGGGCC TTCTTGCGGA TATATTCATA 
961 GCTGGAACTG ATACATCATC AACAACCATA GAATGGGCAA TGTCAGAACT CATAAAAAAT 
1021 CCAAGAGTCT TGAGAAAAGC TCAAGAGGAA GTTAGAGAAG TTTCTAAGGG AAAACAAAAG 
1081 GTCCAAGAAA GTGATCTTTG CAAACTAGAT TACTTGAAAT TGGTCATCAA AGAAACCTTT 
1141 AGACTACACC CACCAGTCCC ATTACTAGTC CCTCGAGTAA CAACAGCCAG CTGCAAAATA 
1201 ATGGAATACG AAATTCCAGT AAATACAAGA GTCTTCATCA ACGCGACAGC AAAT GGGACA 
1261 AATCCAAAAT ACTGGGAAAA TCCATTGACA TTCTTGCCAG AGAGATTCTT GGATAAGGAG 
1321 ATTGATTACA GAGGCAAAAA TTTTGAGTTG TTGCCATTTG GGGCAGGGAG AAGAGGGTGT 
1381 CCAGGAATTA ATTTTTCAAT ACCACTTGTT GAGCTTGCAC TTGCTAATCT ATTGTTTCAT 
1441 TATAATTGGT CACTTCCTGA AGGGATGCTA GCTAAGGATG TTGATATGGA AGAAGCTTTG 
1501 GGGATTACCA TGCACAAGAA ATCTCCCCTT TGCTTAGTAG CTTCTCATTA TACTTGTTGA 
1561 GATTT TAAAA GATTTTAGCA TAGCTATATA TAGCTTGAAG T 

SEQ. ID. NO. 234 

1 MNFLWLASL FLFVFLMRIS KAKKLPPGPR KLPIIGNLHQ IGKLPHRSLQ KLSNEYGDFI 
61 FLQLGSVPTV WSSADIARE IFRTHDLVFS GRPALYAARK LSYNCYNVSF APYGNYWREA 
121 RKILVLELLS TKRVQSFEAI RDEEVSSLVQ IICSSLSSPV NISTLALSLA NNWCRVAFG 
181 KGSAEGGNDY EDRKFNEILY ETQELLGEFN VADYFPRMAW INKINGFDER LENNFRELDK 
241 FYDKVIEDHL NSCSWMKQRD DEDVI DVLLR IQKDPSQEIP LKDDHIKGLL ADIFIAGTDT 
301 SSTTIEWAMS ELIKNPRVLR KAQEEVREVS KGKQKVQESD LCKLDYLKLV IKETFRLHPP 
361 VPLLVPRVTT ASCKIMEYEI PVNTRVFINA TANGTNPKYW ENPLTFLPER FLDKEIDYRG 
421 KNFELLPFGA GRRGCPGINF SIPLVELALA NLLFHYNWSL PEGMLAKDVD MEEALGITMH 
481 KKSPLCLVAS HYTC 
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NAME D222-BH4 
ORGANISM N I COT I ANA TABACUM 

SEQ. ID. ^0^23^^ pj^QpftQTj,QQQ TCTTTGCGGT TATTTCATTC TTTCTACTTC TGTTTTTTCT 
61 TTTCAAATCA TATTTGCCCT CATCGAAAAC AAAGAAAAAT TCTCCACCAT CTCCTTCAAA 
121 GCTTCCGTTA ATCGGTCACT TCCACAAACT AGGCTTACAA CCTCACCGTT CTCTACAAAA 
181 ACTATCAAAT GAACATGGTC CCATGATGAT GCTTCAATTC GGTAGCGTAC CTGTGCTTAT 
241 CGCTTCATCA GCTGAAGCTG CTTCCGAAAT CATGAAAACC CAAGATTTGT CTTTTGCAAA 
301 CAAACCCATT TCAACCATTC CTAGCAAGCT TTTCTTCGGC CCAAAGGACG TTGCCTTCAC 
361 CCCATATGGG GATTACTGGA GGAATGCCAG AAGCATTTGC ATGCTTCAGC TTTTGAACAA 
421 CAAAAGAGTC CAGTCTTTTC GAAAGATAAG GGAAGAAGAG ACTTCTCTTC TTCTCCAGAG 
481 GATTAGGGAA TCGCCAAATT CAGAAGTCGA TTTAACGGAG CTGTTCGTTT CCATGACTAA 
541 CGACATAGTT TGCAGGGTGG CCTTAGGAAG GAAGTATTGT GATGGGGAAG AAGGGAGGAA 
601 ATTCAAGTCT TTGCTGTTAG AGTTTGTGGA ATTGTTGGGA GTTTTTAACA TTGGAGATTA 
661 CATGCCGTGG CTTGCATGGA TGAATCGTTT CAATGGTTTG AATGCCAAAG TGGATAAAGT 
721 GGCGAAAGAG TTTGATGCAT TTTTGGAGGA TGTGATTGAG GAACACGGAG GAAATAAGAA 
781 ATCAGACACT GAAGCTGAAG GGGCAGACTT CGTGGATATA TTATTGCAGG TTCACAAAGA 
841 AAACAAGGCT GGTTTTCAAG TCGAAATGGA TGCAATCAAA GCTATTATCA TGGATATGTT 
901 TGCTGCGGGA ACAGATACAA CTTCCACGCT TCTAGAGTGG ACAATGAACG AGCTCTTAAG 
961 AAATCCAAAA ACATT GAATA AGTTGAGAGA TGAGGTGAGA CAAGTGACTC AAGGGAAGAC 
1021 AGAGGTAACA GAGGATGACT TAGAGAAAAT GCCGTATTTA AGAGCAGCAG TTAAGGAGAG 
1081 TTCCAGGCTA CACTCTCCAG TGCCACTTCT ACCTCGAGAA GCAATTAAGG ATGCAAAGGT 
1141 TTTGGGCTAC GATATAGCTG CAGGGACTCA AGTCCTCGTT TGTCCATGGG CAATCTCAAG 
1201 AGATCCAAAC CTTTGGGAAA ATCCAGAGGA GTTTCAACCT GAAAGATTCT TGGATACTTC 
1261 CATAGATTAC AAAGGCTTAC ATTTCGAGTT AATTCCATTC GGTGCAGGTC GGAGGGGTTG 
1321 CCCTGGCATC ACATTTGCTA AGTTTGTGAA TGAGCTAGCA TTGGCAAGAT TAATGTTCCA 
1381 TTTTGATTTC TCGCTACCAA AAGGAGTTAA GCATGAGGAT TTGGACGTGG AGGAAGCTGC 
1441 TGGAATTACT GTTAGAAGGA AGTTCCCCCT TTTAGCCGTC GCCACTCCAT GCTCGTGATT 
1501 TTTATTTTAG AGCTCATTCT ATGCCTTAAA AACTACTACT AGATAACTGC GTAGTAAATA 
1561 ATGCTTGGTA 



1 MSVFAVISFF LLLFFLFKSY LPSSKTKKNS PPSPSKLPLI GHFHKLGLQP HRSLQKLSNE 
61 HGPMMMLQFG SVPVLIASSA EAASEIMKTQ DLSFANKPIS TIPSKLFFGP KDVAFTPYGD 
121 YWRNARSICM LQLLNNKRVQ SFRKIREEET SLLLQRIRES PNSEVDLTEL FVSMTNDIVC 
181 RVALGRKYCD GEEGRKFKSL LLEFVELLGV FNIGDYMPWL AWMNRFNGLN AKVDKVAKEF 
241 DAFLEDVIEE HGGNKKSDTE AEGADFVDIL LQVHKENKAG FQVEMDAIKA IIMDMFAAGT 
301 DTTSTLLEWT MNELLRNPKT LNKLRDEVRQ VTQGKTEVTE DDLEKMPYLR AAVKESSRLH 
361 SPVPLLPREA IKDAKVLGYD IAAGTQVLVC PWAISRDPNL WENPEEFQPE RFLDTSIDYK 
421 GLHFELIPFG AGRRGCPGIT FAKFVNELAL ARLMFHFDFS LPKGVKHEDL DVEEAAGITV 
481 RRKFPLLAVA TPCS 



FIG. 119 



NAME D224-AF10 
ORGANISM N I COT I ANA TABACUM 

SEQ. ID. "°-237 c ACCTAAAATG GAGAATTCTT GGGTTTTTCT AGCCTTGGCA GGGCTATCTG 
61 CATTAGCTTT TCTCTGTAAA ATAATCACCT GTCGAAGACC GGTTAACCGG AAAATACCAC 
121 CAGGTCCAAA ACCATGGCCC ATCATTGGCA ATTTGAACCT ACTTGGTCCT ATACCACATC 
181 AATCTTTTGA CTTGCTTTCC AAAAAATATG GAGAGTTGAT GCTGCTGAAA TTTGGCTCCA 
241 GGCCAGTTCT TGTTGCTTCA TCTGCTGAAA TGGCAAAACA GTTTTTAAAA GTACATGATG 
301 CTAATTTCGC CTCCCGTCCT ATGCTAGCTG GTGGAAAGTA TACAAGCTAT AACTATTGTG 
361 ACATGACATG GGCACCCTAT GGTCCCTATT GGCGCCAAGC ACGACGAATT TACCTTAACC 
421 AGATATTTAC TCCGAAAAGG CTAGACTCGT TCGAGTACAT TCGTGTTGAA GAAAGGCAGG 
481 CCTTGATTTC CCAGCTGAAT TCCCTTGCTG GAAAGCCATT TTTTCTCAAA GACCATTTGT 
541 CGCGATTTAG CCTCTGCAGC ATGACAAGGA TGGTTTTGAG CAACAAGTAC TTTGGTGAAT 
601 CAACAGTTAG AGTAGAAGAT TTGCAGTACC TGGTAGATCA ATGGTTCTTA CTTAATGGTG 
661 CTTTCAACAT TGGAGATTGG ATTCCATGGC TCAGCTTCTT GGACCTACAA GGCTATGTGA 
721 AACAAATGAA GGCTTTGAAA AGAACTTTTG ATAAGTTCCA CAACATTGTG CTAGATGATC 
781 GCAGGGCTAA GAAGAATGCA GAGAAGAACT TTGTCCCAAA AGACATGGTT GATGTCTTGT 
841 TGAAGATGGC TGAAGATCCT AATCTGGAAG TCAAACTCAC TAATGACTGT GTCAAAGGGT 
901 TAATGCAGGA TTTACTAACT GGAGGAACAG ATAGCTTAAC AGCAGCAGTG CAATGGGCAT 
961 TTCAAGAACT TCTTAGACGG CCAAGGGTTA TTGAGAAGGC AACCGAAGAG CTTGACCGGA 
1021 TTGTCGGGAA AGAGAGATGG GTAGAAGAGA AAGATTGCTC GCAGCTATCT TACGTTGAAG 
1081 CAATCCTCAA GGAAACACTA AGGTTACATC CTCTAGGAAC TATGCTAGCA CCGCATTGTG 
1141 CTATAGAAGA TTGTAACGTG GCTGGTTATG ACATACAGAA AGGAACGACC GTTCTGGTGA 
1201 ATGTTTGGAC CATTGGAAGG GACCCAAAAT ACTGGGATAG AGCACAAGAG TTTCTCCCCG 
1261 AGAGATTCTT AGAGAACGAC ATTGATATGG ACGGACATAA CTTTGCTTTC TTGCCATTTG 
1321 GCTCGGGGCG AAGGAGGTGC CCTGGCTATA GCCTTGGACT TAAGGTTATC CGAGTAACAT 
1381 TAGCCAACAT GTTGCATGGA TTCAACTGGA AATTACCTGA AGGTATGAAG CCAGAAGATA 
1441 TAAGTGTGGA AGAACATTAT GGGCTCACTA CACATCCTAA GTTTCCTGTT CCTGTGATCT 
1501 TGGAATCTAG ACTTTCTTCA GATCTCTATT CCCCCATCAC TTAATCCTAA GTGCTTCCTA 
1561 TTATAGCATC ATATCAATAT CCCTC 

SEQ. ID. N°^^ FLAL AGLSALAFLC KIITCRRPVN RKIPPGPKPW PIIGNLNLLG PIPHQSFDLL 
61 SKKYGELMLL KFGSRPVLVA SSAEMAKQFL KVHDANFASR PMLAGGKYTS YNYCDMTWAP 
121 YGPYWRQARR IYLNQIFTPK RLDSFEYIRV EERQALISOL NSLAGKPFFL KDHLSRFSLC 
181 SMTRMVLSNK YFGESTVRVE DLQYLVDQWF LLNGAFNIGD WIPWLSFLDL QGYVKQMKAL 
241 KRTFDKFHNI VLDDRRAKKN AEKNFVPKDM VDVLLKMAED PNLEVKLTND CVKGLMQDLL 
301 TGGTDSLTAA VQWAFQELLR RPRVIEKATE ELDRIVGKER WVEEKDCSQL SYVEAILKET 
361 LRLHPLGTML APHCAIEDCN VAGYDIQKGT TVLVNWTIG RDPKYWDRAQ EFLPERFLEN 
421 DIDMDGHNFA FLPFGSGRRR CPGYSLGLKV IRVTLANMLH GFNWKLPEGM KPEDISVEEH 
481 YGLTTHPKFP VPVILESRLS SDLYSPIT 
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NAME D224-BD11 
ORGANISM NICOTIANA TABACUM 

SEQ. ID. N0^239 ATCACCTAAA ATGGAGAATT CTTGGGTTTT TCTAGCCTTG GCAGGGCTAT 

61 CTGCATTA.GC TTTTCTCTGT AAAATAATCA CCTGTCGAAG ACCGGTTAAC CGGAAAATAC 
121 CA^GGTCC AAAACCATGG CCCATCATTG GCAATTTGAA CCTACTTGGT CCTATACCAC 
181 ATCAATCTTT TGACTTGCTT TCCAAAAAAT ATGGAGAGTT GATGCTGCTG AAATTTGGCT 
241 cSccagt TCTTGTTGCT TCATCTGCTG AAATGGCAAA ACAGTTTTTA AAAGTACATG 
301 AT^CTAATTT CGCCTCCCGT CCTATGCTAG CTGGTGGAAA GTATACAAGC TATAACTATT 
361 GTGACATGAC ATGGGCACCC TATGGTCCCT ATTGGCGCCA AGCACGACGA CGAATTTACC 
421 TTAACCAGAT ATTTACTCCG AAAAGGCTAG ACTCGTTCGA GTACATTCGT GTTGAAGAAA 
ill GGCAGGCCTT GATTTCCCAG CTGAATTCCC TTGCTGGAAA GCCATTTTTT CTCAAAGACC 
^41 ATTTGTCGCG ATTTAGCCTC TGCAGCATGA CAAGGATGGT TTTGAGCAAC AA^TATTTTG 
601 GTGAATCAAC AGTTAGAGTA GAAGATTTGC AGTACCTGGT AGATCAATGG TTCTTACTTA 
111 CAACATTGGA GATTGGATTC CATGGCTCAG CTTCTTGGAC CTACAAGGCT 

721 ATGTGAAACA AATGAAGGCT TTGAAAAGAA CTTTTGATAA GTTCCACAAC ATTGTGCTAG 

nil aScS gSgaag aatgcagaga agaactttgt cccaaaagac atggttgatg 
111 tcttgttgaa gatggctgaa gatcctaatc tggaagtcaa actcactaat gactgtgtca 

901 AAGGGTTAAT GCAGGATTTA CTAACTGGAG GAACAGATAG CTTAACAGCA GCAGTGCAAT 
111 G^GCATTTCA AGAACTTCTT AGACAGCCAA GGGTTATTGA GAAGGCAACC GAA^CTTG 
1021 ACCGGATTGT CGGGAAAGAG AGATGGGTAG AAGAGAAAGA TTGCTCGCAG CTATCTTACG 
1081 TTGAAGCAAT CCTCAAGGAA ACACTAAGGT TACATCCTCT AGGAACTATG CTAGCACCGC 
1141 ATTGTGCTAT AGAAGATTGT AACGTGGCTG GTTATGACAT ACAGAAAGGA ACGACCGTTC 
1201 TGGTGAATGT TTGGACCATT GGAAGGGACC CAAAATACTG GGATAGAGCA CAAGAGTTTC 
1261 TCCCcS ATTCTTAGAG AACGACATTG ATATGGACGG ACATAACTTT GCTTTCTTGC 
1321 CATTTGGCTC GGGGCGAAGG AGGTGCCCTG GCTATAGCCT TGGACTTAAG GTTATCCGAG 
1381 TAACATTAGC CAACATGTTG CATGGATTCA ACTGGAAATT ACCTGAAGGT ATGAAGCCAG 
[HI AAGMATAAG TGTGGAAGAA CATTATGGGC TCACTACACA TCCTAAGTTT CCTGTTCCTG 
TGATCTTGGA ATCTAGACTT TCTTCAGATC TCTATTCCCC CATCACTTAA TCCTAAGTGC 
1561 TTCCTATTAT AGCAT CAT AT CAATATCCCT C 

SEQ. ID. NO^240^ AGLSALAFLC KIITCRRPVN RKIPPGPKPW PIIGNLNLLG PIPHQSFDLL 

61 SKKYGELMLL KFGSRPVLVA SSAEMAKQFL KVHDANFASR PMLAGGKYTS ™YCDMTWAP 
121 YGPYWRQARR RIYLNQIFTP KRLDSFEYIR VEERQALISQ LNSLAGKPFF LKDHLSRFSL 
181 CSMT^N KYFGESTVRV EDLQYLVDQW FLLNGAFNIG DWIPWLSFLD LO^YVKQMKA 
241 LKRTFDKFHN IVLDDHRAKK NAEKNFVPKD MVDVLLKMAE DPNLEVKLTN DCVKGLMQDL 
301 LTGGTDSLTA AVQWAFQELL RQPRVIEKAT EELDRIVGKE RWVEEKDCSQ MWEAILKE 
361 TLRLHPLGTM IAPHCAIEDC NVAGYDIQKG TTVLWVWTI GRDPKYWDRA MFLPERFLE 
421 NDIDMDGHNF AFLPFGSGRR RCPGYSLGLK VIRVTLANML HGFNWKLPEG MKPEDISVEE 
481 HYGLTTHPKF PVPVILESRL SSDLYSPIT 
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NAME D228-AD7 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 241 

1 TGATAATGCT CTTTCTACTC TTTGTAGCCC TTCCTTTCAT TCTTATTTTT CTTCTTCCTA 
61 AATTCAAAAA TGGTGGAAAT AACAGATTGC CACCAGGTCC TATAGGTTTA CCATTCATTG 
121 GAAATTTGCA TCAATACGAT AGTATAACTC CTCATATCTA TTTTTGGAAA CTTTCAAAAA 
181 AATATGGCAA AATCTTCTCA TTAAAACTTG CTTCTACTAA TGTGGTAGTA GTTTCTTCAG 
241 CAAAATTAGC AAAAGAAGTA TTGAAAAAAC AAGATTTAAT ATTTTGTAGT AGACCATCTA 
301 TTCTTGGCCA ACAAAAACTG TCTTATTATG GTCGTGATAT TGCTTTTAAT GATTATTGGA 
361 GAGAAATGAG AAAAATTTGT GTTCTTCATC TTTTTAGTTT AAAAAAAGTT CAATTATTTA 
421 GTCCAATTCG TGAAGATGAA GTTTTTAGAA TGATTAAGAA AATATCAAAA CAAGCTTCTA 
481 CTTCACAAAT TATTAATTTG AGTAATTTAA TGATTTCATT AACAAGTACA ATTATTTGTA 
541 GAGTTGCTTT TGGTGTTAGG ATTGAAGAAG AAGCACATGC AAGGAAGAGA TTTGATTTTC 
601 TTTTGGCCGA GGCACAAGAA ATGATGGCTA GTTTCTTTGT ATCTGATTTT TTTCCCTTTT 
661 TAAGTTGGAT TGATAAATTA AGTGGATTGA CATATAGACT TGAGAGGAAT TTCAAGGATT 
721 TGGATAATTT TTATGAAGAA CTCATTGAGC AACATCAAAA TCCTAATAAG CCAAAATATA 
781 TGGAAGGAGA TATTGTTGAT CTTTTGCTAC AATTGAAGAA AGAGAAATTA ACACCACTTG 
841 ATCTCACTAT GGAAGATATA AAAGGAATTC TCATGAATGT GTTAGTTGCA GGATCAGACA 
901 CTAGTGCAGC TGCTACTGTT TGGGCAATGA CAGCCTTGAT AAAGAATCCT AAAGCCATGG 
961 AAAAAGTTCA ATTAGAAATC AGAAAATCAG TTGGGAAGAA AGGCATTGTA AATGAAGAAG 
1021 ATGTCCAAAA CATCCCTTAT TTTAAAGCAG TGATAAAGGA AATATTTAGA TTGTATCCAC 
1081 CAGCTCCACT TTTAGTTCCA AGAGAATCAA TGGAAAAAAC CATATTAGAA GGTTATGAAA 
1141 TTCGGCCAAG AACCATAGTT CATGTTAACG CTTGGGCTAT AGCAAGGGAT CCTGAAATAT 
1201 GGGAAAATCC AGATGAATTT ATACCTGAGA GATTTTTGAA TAGCAGTATC GATTACAAGG 
1261 GTCAAGATTT TGAGTTACTT CCATTTGGTG CAGGCAGAAG AGGTTGCCCA GGTATTGCAC 
1321 TTGGGGTTGC ATCCATGGAA CTTGCTTTGT CAAATCTTCT TTATGCATTT GATTGGGAGT 
1381 TGCCTTATGG AGTAAAAAAA GAAGACATCG ACACAAACGT TAGGCCTGGA ATTGCCATGC 
1441 ACAAGAAAAA CGAACTTTGC CTTGTCCCAA AAAATTATTT ATAAATTATA TTGGGACGTG 
1501 GATCTCATGC TAGTTCTGTG CGGTCAGCTA AGCT TATTAT TTTTGGCTCA AATTATGTAT 
1561 ACATAATTAG TACATGTTTA AAATGTATAA ATATAGTAGA ACCATTCTCA TGGTT 



1 MLFLLFVALP FILIFLLPKF KNGGNNRLPP GPIGLPFIGN LHQYDSITPH IYFWKLSKKY 
61 GKIFSLKLAS TNWWSSAK LAKEVLKKQD LIFCSRPSIL GQQKLSYYGR DIAFNDYWRE 
121 MRKICVLHLF SLKKVQLFSP IREDEVFRMI KKISKQASTS QIINLSNLMI SLTSTIICRV 
181 AFGVRIEEEA HARKRFDFLL AEAQEMMASF FVSDFFPFLS WIDKLSGLTY RLERNFKDLD 
241 NFYEELIEQH QNPNKPKYME GDIVDLLLQL KKEKLTPLDL TMEDIKGILM NVLVAGSDTS 
301 AAATVWAMTA LIKNPKAMEK VQLEIRKSVG KKGIVNEEDV QNIPYFKAVI KEIFRLYPPA 
361 PLLVPRESME KTILEGYEIR PRTIVHVNAW AIARDPEIWE NPDEFIPERF LNSSIDYKGQ 
421 DFELLPFGAG RRGCPGIALG VASMELALSN LLYAFDWELP YGVKKEDIDT NVRPGIAMHK 
481 KNELCLVPKN YL 



FIG. 122 



NAME D228-AH8 
ORGANISM N I COT I AN A TABACUM 

SEQ. ID. ^°^^ TGCT CTTTCTACTC TTTGTAGCCC TTCCTTTCAT TCTTATTTTT CTTCTTCCTA 
61 AATTCAAAAA TGGTGGAAAT AACAGATTGC CACCAGGTCC TATAGGTTTA CCATTCATTG 
121 GAAATTTGCA TCAATATGAT AGTATAACTC CTCATATCTA TTTTTGGAAA CTTTCCAAAA 
181 AATATGGCAA AATCTTCTCA TTAAAACTTG CTTCTACTAA TGTGGTAGTA GTTTCTTCAG 
241 CAAAATTAGC AAAAGAAGTA TTGAAAAAAC AAGATTTAAT ATTTTGTAGT AGACCATCTA 
301 TTCTTGGCCA ACAAAAACTG TCTTATTATG GTCGTGATAT TGCTTTTGCA CCTTATAATG 
361 ATTATTGGAG AGAAATGAGA AAAATTTGTG TTCTTCATCT TTTTAGTTTA AAAAAAGTTC 
421 AATTATTTAG TCCAATTCGT GAAGATGAAG TTTTTAGAAT GATTAAGAAA ATATCAAAAC 
481 AAGCTTCTAC TTCACAAATT ATTAATTTGA GTAATTTAAT GATTTCATTA ACAAGTACAA 
541 TTATTTGTAG AGTTGCTTTT GGTGTTAGGT TTGAAGAAGA AGCACATGCA AGGAAGAGAT 
601 TTGATTTTCT TTTGGCCGAG GCACAAGAAA TGATGGCTAG TTTCTTTGTA TCTGATTTTT 
661 TTCCCTTTTT AAGTTGGATT GATAAATTAA GTGGATTGAC ATATAGACTT GAGAGGAATT 
721 TCAAGGATTT GGATAATTTT TATGAAGAAC TCATTGAGCA ACATCAAAAT CCTAATAAGC 
781 CAAAATATAT GGAAGGAGAT ATTGTTGATC TTTTGCTACA ATTGAAGAAA GAGAAATTAA 
841 CACCACTTGA TCTCACTATG GAAGATATAA AAGGAATTCT CATGAATGTG TTAGTTGCAG 
901 GATCAGACAC TAGTGCAGCT GCTACTGTTT GGGCAATGAC AGCCTTGATA AAGAATCCTA 
961 AAGCCATGGA AAAAGTTCAA TTAGAAATCA GAAAATCAGT TGGGAAGAAA GGCATTGTAA 
1021 ATGAAGAAGA TGTCCAAAAC ATCCCTTATT TTAAAGCAGT GATAAAGGAA ATATTTAGAT 
1081 TGTATCCACC AGCTCCACTT TTAGTTCCAA GAGAATCAAT GGAAAAAACC ATATTAGAAG 
1141 GTTATGAAAT TCGGCCAAGA ACCATAGTTC ATGTTAACGC TTGGGCTATA GCAAGGGATC 
1201 CTGAAATATG GGAAAATCCA GATGAATTTA TACCTGAGAG ATTTTTGAAT AGCAGTATCG 
1261 ATTACAAGGG TCAAGATTTT GAGTTACTTC CATTTGGTGC AGGCAGAAGA GGTTGCCCAG 
1321 GTATTGCACT TGGGGTTGCA TCCATGGAAC TTGCTTTGTC AAATCTTCTT TATGCATTTG 
1381 ATTGGGAGTT GCCTTATGGA GTGAAAAAAG AAGACATCGA CACAAACGTT AGGCCTGGAA 
1441 TTGCCATGCA CAAGAAAAAC GAACTTTGCC TTGTCCCAAA AAATTATTTA TAAATTATAT 
1501 TGGGACGTGG ATCTCATGCT AGTTCTGTGC GGTCAGCTAA GCTTATTATT TTTGGCTCAA 
1561 ATTATGTATA CATAATTAGT ACATGTTTAA AATGTATAAA TATAGTAGAA CCATTCTCAT 
1621 GGTT 



, 11 \J . £, *± *1 

1 MLFLLFVALP FILIFLLPKF KNGGNNRLPP GPIGLPFIGN LHQYDSITPH IYFWKLSKKY 
61 GKIFSLKLAS TNWWSSAK LAKEVLKKQD LIFCSRPSIL GQQKLSYYGR DIAFAPYNDY 
121 WREMRKICVL HLFSLKKVQL FSPIREDEVF RMIKKISKQA STSQIINLSN LMISLTSTII 
181 CRVAFGVRFE EEAHARKRFD FLLAEAQEMM ASFFVSDFFP FLSWIDKLSG LTYRLERNFK 
241 DLDNFYEELI EQHQNPNKPK YMEGDIVDLL LQLKKEKLTP LDLTMEDIKG ILMNVLVAGS 
301 DTSAAATVWA MTALIKNPKA MEKVQLEIRK SVGKKGIVNE EDVQNIPYFK AVIKEIFRLY 
361 PPAPLLVPRE SMEKTILEGY EIRPRTIVHV NAWAIARDPE IWENPDEFIP ERFLNSSIDY 
421 KGQDFELLPF GAGRRGCPGI ALGVASMELA LSNLLYAFDW ELPYGVKKED IDTNVRPGIA 
481 MHKKNELCLV PKNYL 
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NAME D235-AB1 
ORGANISM N I COT I ANA TABACUM 
SEO ID NO 245 

1 AAAATTCATA ATGGTTTTTC CCATAGAAGC CTTTGTAGGA CTAGTAACCT TCACATTTCT 
61 CTTATACTTC CTATGGACAA AAAAATCTCA AAAACTTCCA AAACCCTTAC TACCGAAAAT 
121 CCCCGGAGGA TGGCCGGTAA TCGGCCATCT TTTTCACTTC AATAACGACG GCGACGACCG 
181 TCCATTAGCT CGAAAACTCG GAGACTTAGC TGATAAATAC GGCCCCGTTT TCACTTTTCG 
241 GCTAGGTCTT CCCCTTGTGC TAGTTGTAAG CAGTTACGAA GCTATAAAAG ATTGCTTCTC 
301 TACAAATGAC GCCATTTTCT CCAATCGTCC AGCTTTTCTT TACGGCGAAT ACCTTGGCTA 
361 CAATAATACA ATGCTTTTTC TAGCAAATTA CGGACCTTAC TGGCGAAAAA ATCGTAAATT 
421 AGTCATTCAG GAAGTTCTCT CTGCTAGTCG TCTCGAAAAA TTCAAACAAG TGAGATTCAC 
481 CAGAATTCAA ACGAGCATTA AGAATTTATA CACTCGAATT AATGGAAATT CGAGTACGAT 
541 AAATCTAACT GATTGGTTAG AAGAATTGGA TTTTGGTCTG ATCGTGAAAA TGATCGCTGG 
601 GAAAAATTAT GAATCCGGTA AAGGAGATGA ACAAGTGGAA AGATTTAAGA ATGCGTTTAA 
661 GGATTTTATG GTTTTATCAA TGGAATTTGT ATTATGGGAT GCATTTCCAA TTCCATTATT 
721 TAAATGGGTG GATTTTCAAG GTCATATTAA GGCAATGAAA AGGACATTTA AGGATATAGA 
781 TTCTGTTTTT CAGAACTGGT TAGAGGAACA TATTAATAAA AGAGAAAAAA TGGAGGTTGG 
841 TGCAGAAGGG AATGAACAAG ATTTCATTGA TGTGGTGCTT TCAAAATTGA GTAAAGAATA 
901 TCTTGATGAA GGTTACTCTC GTGATACTGT CATTAAAGCA ACAGTTTTTA GTTTGGTCTT 
961 GGATGCAGCA GACACAGTTG CTCTTCACAT AAATTGGGGA ATGACATTAT TGATAAACAA 
1021 TCAAAATGCC TTGATGAAAG CACAAGAAGA GATAGACACA AAAGTTGGTA AGTATAGATG 
1081 GGTAGAAGAG AGTGATATTA AGGATTTAGT ATACCTCCAA GCTATTGTTA AAAAGGTGTT 
1141 ACGATTATAT CCACCAGGAC CTTTGTTAGT ACCACATGAA TATGTAAAGG ATTGTGTTGT 
1201 TAGTGGATAT CACATTCCTA AAGGGACTAG ATTATTCGCA AACGTCATGA AACTGCAGCG 
1261 CGATCCTAAA CTCTTGTCAA ATCCTGATAA GTTCGATCCA GAGAGATTCA TCGCTGGTGA 
1321 TATCGACTTC CGTGGTCACC ACTATGAGTT TATCCCATTT GGTTCTGGAA GACGATCTTG 
1381 TCCGGGGATG ACT TAT G CAT TGCAAGTGGA ACACCTAACA ATGGCACATT TAATCCAGGG 
1441 TTTCAATTAC AAAACTCCAA ATGACGAGGC CTTGGATATG AAGGAAGGTG CAGGCATAAC 
1501 AATACGTAAG GTAAATCCGG TGGAATTGAT AATAACGCCT CGCTTGGCAC CTGAGCTTTA 
1561 CTAAAACCTA AGATCTTTCA TCTTGGTTGA TCATTGTTTA ATACTCCTAG ATAGATGGGT 
1621 ATT CATC 



1 MVFPIEAFVG LVTFTFLLYF LWTKKSQKLP KPLLPKIPGG WPVIGHLFHF NNDGDDRPLA 
61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AIKDCFSTND AIFSNRPAFL YGEYLGYNNT 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK FKQVRFTRIQ TSIKNLYTRI NGNSSTINLT 
181 DWLEELDFGL IVKMIAGKNY ESGKGDEQVE RFKNAFKDFM VLSMEFVLWD AFPIPLFKWV 
241 DFQGHIKAMK RTFKDIDSVF QNWLEEHINK REKMEVGAEG NEQDFIDWL SKLSKEYLDE 
301 GYSRDTVIKA TVFSLVLDAA DTVALHINWG MTLLINNQNA LMKAQEEIDT KVGKYRWVEE 
361 SDIKDLVYLQ AIVKKVLRLY PPGPLLVPHE YVKDCWSGY HIPKGTRLFA NVMKLQRDPK 
421 LLSNPDKFDP ERFIAGDIDF RGHHYEFIPF GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
481 KTPNDEALDM KEGAGITIRK VNPVELIITP RLAPELY 
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NAME D243-AA2 
ORGANISM N I COT I AN A TABACUM 
SEQ. ID. NO. 247 

1 CAAAAAATCA TTTCTCTCGT CTAAAATGGA TCTTCTCTTA CTAGAGAAGA CCTTAATTGG 
61 TCTTTTCTTT GCCATTTTAA TCGCTTTAAT TGTCTCTAAA CTTCGTTCAA AGCGTTTTAA 
121 GCTTCCTCCA GGACCAATTC CAGTACCAGT TTTTGGTAAT TGGCTTCAAG TTGGTGATGA 
181 TTTAAACCAC AGAAATCTTA CTGATTATGC CAAAAAATTT GGCGATCTTT TCTTGTTAAG 
241 AATGGGTCAA CGTAACTTAG TTGTTGTGTC ATCTCCTGAA TTAGCTAAAG AAGTTTTACA 
301 CACACAAGGT GTTGAATTTG GTTCAAGAAC AAGAAATGTT GTGTTTGATA TTTTTACTGG 
361 AAAAGGTCAA GATATGGTTT TTACTGTATA TGGTGAACAT TGGAGAAAAA TGAGGAGAAT 
421 TATGACTGTA CCATTTTTTA CTAATAAAGT TGTGCAACAG TATAGAGGGG GGTGGGAGTT 
481 TGAGGTGGCA AGTGTAATTG AGGATGTGAA AAAAAATCCT GAATCTGCTA CTAATGGGAT 
541 CGTATTAAGG AGGAGAT TAC AATTAATGAT GTATAATAAT ATGTTTAGGA TTATGTTTGA 
601 TAGGAGATTT GAGAGTGAAG ATGATCCTTT GTTTGTTAAG CTTAAGGCTT TGAATGGTGA 
661 AAGGAGTAGA TTGGCTCAAA GTTTTGAGTA TAATTATGGT GATTTTATTC CAATTTTGAG 
721 GCCTCTTTTG AGAGGTTATT TGAAGATCTG TAAAGAAGTT AAGGAGAAGA GGCTGCAGCT 
781 TTTCAAAGAT TACTTTGTTG ATGAAAGAAA GAAGCTTTCA AATACCAAGA GCTCGGACAG 
841 CAATGCCCTA AAATGTGCGA TTGATCACAT TCTTGAGGCT CAACAGAAGG GAGAG AT C AA 
901 TGAGGACAAC GTTCTTTACA TTGTTGAAAA CATCAATGTT GCTGCAATTG AAACAACATT 
961 ATGGTCAATT GAGTGGGGTA TCGCCGAGCT AGTCAACCAC CCTCACATCC AAAAGAAACT 
1021 GCGCGACGAG ATTGACACAG TTCTTGGACC AGGAGTGCAA GTGACTGAAC CAGACACCCA 
1081 CAAGCTTCCA TACCTTCAGG CTGTGATCAA GGAGGCACTT CGTCTCCGTA TGGCAATTCC 
1141 TCTATTAGTC CCACACATGA ACCTTCACGA CGCAAAGCTT GGCGGGCTTG ATATTCCAGC 
1201 AGAGAGCAAA ATCTTGGTTA ACGCTTGGTG GTTAGCTAAC AACCCGGCTC ATTGGAAGAA 
1261 ACCCGAAGAG TTCAGACCCG AGAGGTTCTT TGAAGAGGAG AAGCATGTTG AGGCCAATGG 
1321 CAATGACTTC AGATATCTTC CGTTTGGCGT TGGTAGGAGG AGCTGCCCTG GAATTATACT 
1381 TGCATTGCCA ATTCTTGGCA TCACTTTGGG ACGTTTGGTT CAGAACTTTG AGCTGTTGCC 
1441 TCCTCCAGGC CAGTCGAAGC TCGACACCAC AGAGAAAGGT GGACAGTTCA GTCTCCACAT 
1501 TTTGAAGCAT TCCACCATTG TGTTGAAACC AAGGTCTTTC TGAACTTTGT GATCTTATTA 
1561 ATTAAGGGGT TCTGAAGAAA TTTGATAGTG TTGG 

SEQ. ID. NO. 248 

1 MDLLLLEKTL IGLFFAILIA LIVSKLRSKR FKLPPGPIPV FVFGNWLQVG DDLNHRNLTD 
61 YAKKFGDLFL LRMGQRNLW VSSPELAKEV LHTQGVEFGS RTRNWFDIF TGKGQDMVFT 
121 VYGEHWRKMR RIMTVPFFTN KWQQYRGGW EFEVASVIED VKKNPESATN GIVLRRRLQL 
181 MMYNNMFRIM FDRRFESEDD PLFVKLKALN GERSRLAQSF EYNYGDFIPI LRPLLRGYLK 
241 ICKEVKEKRL QLFKDYFVDE RKKLSNTKSS DSNALKCAID HILEAQQKGE INEDNVLYIV 
301 ENINVAAIET TLWSIEWGIA ELVNHPHIQK KLRDEIDTVL GPGVQVTEPD THKLPYLQAV 
361 I KEALRLRMA IPLLVPHMNL HDAKLGGLDI PAESKILVNA WWLANNPAHW KKPEEFRPER 
421 FFEEEKHVEA NGNDFRYLPF GVGRRSCPGI ILALPILGIT LGRLVQNFEL LPPPGQSKLD 
481 TTEKGGQFSL HILKHSTIVL KPRSF 
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NAME D244-AD4 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 249 

1 AACATTTTGC AATATAGTTT TCCTAGTCAG TTCTAGCCTC CTTTTCCTTA GAAATAATGG 
61 ATTATCATAT TTCTTTCCAT TTTCAAGCTC TTTTAGGGCT TTTAGCCTTT GTGTTCTTGT 
121 CTATTATCTT ATGGAGAAGA ACACTCACTT CAAGAAAATT AGCCCCTGAA ATCCCAGGGG 
181 CATGGCCTAT TATAGGCCAT CTTCGTCAGC TGAGTGGTAC TGATAAGAAT ATCCCATTTC 
241 CCCGAATATT GGGCGCTTTG GCAGATAAAT ATGGACCTGT CTTCACACTG AGAATAGGGA 
301 TGTACCCCTA TTTGATTGTC AACAATTGGG AAGCAGCTAA GGATTGTCTC ACAACGCATG 
361 ATAAGGACTT CGCTGCCCGA CCAACTTCTA TGGCTGGTGA AAGCATCGGG TACAAGTATG 
421 CGAGGTTTAC TTATGCTAAT TTTGGTCCTT ATTATAACCA AGTGCGCAAA CTAGCCCTAC 
481 AACATGTACC CTCGAGTACT AAACTCGAGA AAATGAAACA CATACGTGTT TCTGAATTGG 
541 AAACTAGCAT CAAAGAATTA TATTCTTTGA CGCTGGGCAA AAACAACATG CAAAAAGTGA 
601 ATATAAGTAA ATGGTTTGAA CAATTGACTT TAAACATAAT CGTGAAGACA ATTTGTGGCA 
661 AGAGATATAG CAACATAGAG GAGGATGAAG AGGCACAACG TTTCAGAAAG GCATTTAAGG 
721 GCATCATGTT TGTTGTAGGG CAAATTGTTT TATATGACGC AATTCCATTC CCATTGTTCA 
781 AATACTTTGA TTTCCAAGGT CATATACAAT TGATGAACAA AATTTATAAA GACTTAGATT 
8 41 CTATTCTTCA AGGATGGTTG GAT GAT CAT A TGATGAACAA GGATGTAAAC AATAAGGATC 
901 AAGATGCCAT AGATGCCATG CTTAAGGTAA CACAACTTAA TGAATTCAAA GCCTATGGTT 
961 TTTCTCAGGC CACTGTGATC AAGTCGACAG TCTTGAGTTT GATCTTAGAT GGAAATGACA 
1021 CAACCGCTGT TCATTTGATA TGGGTAATGT CCTTATTACT GAACAATCCA CATGTTATGA 
1081 AACAAGGCCA AGAAGAGATA GACATGAAAG TGGGTAAAGA GAGGTGGATT GAAGATACTG 
1141 ACATAAAAAA TTTAGTGTAC CTTCAGGCTA TCGTTAAAGA GACATTGCGC TTGTATCCAC 
1201 CTGTTCCTTT TCTTTTACCA CACGAAGCAG TGCAAGATTG TAAAGTGACT GGTTACCACA 
1261 TTCCTAAAGG TACTCGTCTA TATATCAATG CGTGGAAAGT ACATCGCGAT CCTGAAATTT 
1321 GGTCAGAGCC CGAAAAGTTT ATGCCCAATA GATTCTTGAC TAGCAAAGCA AATATAGATG 
1381 CTCGCGGTCA AAATTTTGAA TTTATACCGT TTGGTTCTGG GAGACGGTCA TGTCCAGGGA 
1441 TAGGTTTTGC GACTTTAGTG ACACATCTGA CTTTTGGTCG CTTGCTTCAA GGTTTTGATT 
1501 TTAGTAAGCC ATCAAACACG CCAATTGACA TGACAGAAGG CGTAGGCGTT ACTTTGCCTA 
1561 AGGTTAATCA AGTTGAAGTT CTAATTACCC CTCGTTTACC TTCTAAGCTT TATTTATTTT 
1621 GAAAGTGCAA ATCATCAATC ATGGCTTGAG TAATTAGTTA TACTTTAATA TGTTTCTC 

SEQ. ID. NO. 250 

1 MDYHISFHFQ ALLGLLAFVF LSIILWRRTL TSRKLAPEIP GAWPIIGHLR QLSGTDKNIP 
61 FPRILGALAD KYGPVFTLRI GMYPYLIVNN WEAAKDCLTT HDKDFAARPT SMAGESIGYK 
121 YARFTYANFG PYYNQVRKLA LQHVPSSTKL EKMKHIRVSE LETSIKELYS LTLGKNNMQK 
181 VNISKWFEQL TLNIIVKTIC GKRYSNIEED EEAQRFRKAF KGIMFWGQI VLYDAIPFPL 
241 FKYFDFQGHI QLMNKIYKDL DSILQGWLDD HMMNKDVNNK DQDAIDAMLK VTQLNEFKAY 
301 GFSQATVIKS TVLSLILDGN DTTAVHLIWV MSLLLNNPHV MKQGQEEI DM KVGKERWIED 
361 TDIKNLVYLQ AIVKETLRLY PPVPFLLPHE AVQDCKVTGY HIPKGTRLYI NAWKVHRDPE 
421 IWSEPEKFMP NRFLTSKANI DARGQNFEFI PFGSGRRSCP GIGFATLVTH LTFGRLLQGF 
481 DFSKPSNTPI DMTEGVGVTL PKVNQVEVLI TPRLPSKLYL F 
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NAME D247-AH1 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 251 

1 TGATAATGCT CTTTCTACTC TTTGTAGCCC TTCCTTTCAT TCTTATTTTT CTTCTTCCTA 
61 AATTCAAAAA TGGTGGAAAT AACAGATTGC CACCAGGTCC TATAGGTTTA CCATTCATTG 
121 GAAATTTGCA TCAATATGAT AGTATAACTC CTCATATCTA TTTTTGGAAA CTTTCCAAAA 
181 AATATGGCAA AATCTTCTCA TTAAAACTTG CTTCTACTAA TGTGGTAGTA GTTTCTTCAG 
241 CAAAATTAGC AAAAGAAGTA TTGAAAAAAC AAGATTTAAT ATTTTGTAGT AGACCATCTA 
301 TTCTTGGCCA ACAAAAACTG TCTTATTATG GTCGTGATAT TGCTTTTGCA CCTTATAATG 
361 ATTATTGGAG AGAAATGAGA AAAATTTGTG TTCTTCATCT TTTTAGTTTA AAAAAAGTTC 
421 AATTATTTAG TCCAATTCGT GAAGATGAAG TTTTTAGAAT GATTAAGAAA ATATCAAAAC 
481 AAGCTTCTAC TTCACAAATT ATTAATTTGA GTAATTTAAT GATTTCATTA ACAAGTACAA 
541 TTATTTGTAG AGTTGCTTTT GGTGTTAGGT TTGAAGAAGA AGCACATGCA AGGAAGAGAT 
601 TTGATTTTCT TTTGGCCGAG GCACAAGAAA TGATGGCTAG TTTCTTTGTA TCTGATTTTT 
661 TTCCCTTTTT AAGTTGGATT GATAAATTAA GTGGATTGAC ATATAGACTT GAGAGGAATT 
721 TCAAGGATTT GGATAATTTT TATGAAGAAC TCATTGAGCA ACATCAAAAT CCTAATAAGC 
781 CAAAATATAT GGAAGGAGAT ATTGTTGATC TTTTGCTACA ATTGAAGAAA GAGAAATTAA 
841 CACCACTTGA TCTCACTATG GAAGATATAA AAGGAATTCT CATGAATGTG TTAGTTGCAG 
901 GATCAGACAC TAGTGCAGCT GCTACTGTTT GGGCAATGAC AGCCTTGATA AAGAATCCTA 
961 AAGCCATGGA AAAAGTTCAA TTAGAAATCA GAAAATCAGT TGGGAAGAAA GGCATTGTAA 
1021 ATGAAGAAGA TGTCCAAAAC ATCCCTTATT TTAAAGCAGT GATAAAGGAA ATATTTAGAT 
1081 TGTATCCACC AGCTCCACTT TTAGTTCCAA GAGAATCAAT GGAAAAAACC ATATTAGAAG 
1141 GTTATGAAAT TCGGCCAAGA ACCATAGTTC ATGTTAACGC TTGGGCTATA GCAAGGGATC 
1201 CTGAAATATG GGAAAATCCA GATGAATTTA TACCTGAGAG ATTTTTGAAT AGCAGTACCG 
1261 ATTACAAGGG TCAAGATTTT GAGTTACTTC CATTTGGTGC AGGCAGAAGA GGTTGCCCAG 
1321 GTATTGCACT TGGGGTTGCA TCCATGGAAC TTGCTTTGTC AAATCTTCTT TATGCATTTG 
1381 ATTGGGAGTT GCCTTATGGA GTGAAAAAAG AAGACATCGA CACAAACGTT AGGCCTGGAA 
1441 TTGCCATGCA CAAGAAAAAC GAACTTTGCC TTGTCCCAAA AAATTATTTA TAAATTATAT 
1501 TGGGACGTGG ATCTCAATTT AGTTCTGTGA GGTCAGC 

SEQ. ID. NO. 252 

1 MLFLLFVALP FILIFLLPKF KNGGNNRLPP GPIGLPFIGN LHQYDSITPH IYFWKLSKKY 
61 GKIFSLKLAS TNWWSSAK LAKEVLKKQD LIFCSRPSIL GQQKLSYYGR DIAFAPYNDY 
121 WREMRKICVL HLFSLKKVQL FSPIREDEVF RMIKKISKQA STSQIINLSN LMISLTSTII 
181 CRVAFGVRFE EEAHARKRFD FLLAEAQEMM ASFFVSDFFP FLSWIDKLSG LTYRLERNFK 
241 DLDNFYEELI EQHQNPNKPK YMEGDIVDLL LQLKKEKLTP LDLTMEDIKG ILMNVLVAGS 
301 DTSAAATVWA MTALIKNPKA MEKVQLEIRK SVGKKGIVNE EDVQNIPYFK AVIKEIFRLY 
361 PPAPLLVPRE SMEKTILEGY EIRPRTIVHV NAWAIARDPE IWENPDEFIP ERFLNSSTDY 
421 KGQDFELLPF GAGRRGCPGI ALGVASMELA LSNLLYAFDW ELPYGVKKED IDTNVRPGIA 
481 MHKKNELCLV PKNYL 
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NAME D248-AA6 
ORGANISM N I COT I ANA TABACUM 

SEQ. ID. ^°^ C ^ ATCAT GGCTCTATCX TTCATATTCA TATCCATAAC CCTAATTTTT CTAGTTCATA 
61 AACTCTACCA CCGTCTTAGA TTCAAACTAC CACCAGGTCC GCGGCCGTTA CCGGTGGTCG 
121 GAAACCTCTA CGACATAAAA CCGGTGAGAT TCCGGTGCTT TGCCGATTGG GCCAAAACTT 
181 ACGGTCCGAT TTTCTCAGTA TACTTTGGGT CACAGTTAAA TGTTGTGGTA ACAACAGCTG 
241 AATTAGCTAA AGAAGTATTG AAAGAAAATG ACCAGAATTT AGCAGATAGA TTTAGGACTA 
301 GACCTGCAAA TAATTTGAGC AGAAATGGGA TGGATTTGAT TTGGGCTGAT TATGGGCCTC 
361 ATTATGTGAA AGTAAGGAAG CTCTGTAATC TTGAGCTTTT TACT CCT AAA AGACTTGAAG 
421 CTCTTAGACC TATTAGAGAA GATGAAGTTA CTGCTATGGT TGAAAACATT TTCAAGGATT 
481 GTACTAAGCC T GAT AACACA GGTAAAAGCT TGTTGATAAG AGAGTACTTA GGATCAGTAG 
541 CATTCAACAA CATTACAAGG TTAACATTTG GGAAAAGGTT CATGAACTCA AAAGGT GAGA 
601 TTGATGAGCA AGGTCAAGAA TTCAAGGGTA TTGTCTCTAA TGGCATCAAA ATTGGCGGAA 
661 AACTTCCCTT GGCAGAGTAT GTTCCATGGC TCCGTTGGTT TTTCACAATG GAAAACGAGG 
721 CACTCGTGAA GCACTCTGCA CGTAGAGACC GGTTAACAAG AATGATCATG GATGAACACA 
781 CACTGGCTCG CAAGAAAACT GGT GAT ACTA AGCAGCATTT TGTCGATGCA TTGCTTACTC 
841 TTCAGAAGCA GTATGATCTT AGTGATGACA CTGTTATTGG CCTCCTCTGG GATATGATTA 
901 CAGCAGGAAT GGACACAACA ACCATAACAG TGGAATGGGC AATGGCAGAA CTAGTTAAGA 
961 ACCCAAGAGT GCAACTAAAA GCTCAAGAGG AGCTTGACAG GGTAATCGGA ACGGATCGAA 
1021 TCATGTCAGA AACCGATTTC TCTAAACTTC CTTACCTACA ATGTGTAGCC AAAGAGGCTC 
1081 TAAGGTTGCA CCCTCCAACT CCTCTAATGC TTCCTCATAA GGCCAGTGCC AGTGTCAAAA 
1141 TTGGTGGTTA TGACATTCCT AAGGGGTCCA TCGTGCACGT GAACGTTTGG GCTGTCGCTC 
1201 GTGACCCAGC CGTGTGGAAG AACCCGTTGG AGTTCAGACC AGAGCGCTTC CTTGAGGAAG 
1261 ACGTTGACAT GAAGGGTCAC GACTATCGGT TATTGCCCTT TGGTGCAGGA AGGCGTGTTT 
1321 GCCCCGGTGC ACAACTTGCT ATCAACTTGG TCACATCTAT GTTGGGTCAT TTGTTGCATC 
1381 ATTTTACATG GGCTCCGGCC CCGGGGGTTA ACCCGGAGGA TATTGACTTG GAGGAGAGCC 
1441 CTGGAACAGT AACTTACATG AAAAATCCAA TACAAGCTAT TCCAACTCCA AGATTGCCTG 
1501 CACACTTGTA TGGACGTGTG CCAGTGGATA TGTAAAACAT TTTGTTCTTT CCCTTTTTGG 
1561 TTATATGATG AG 



i . N\J . £. J *i 

1 MALSFIFISI TLIFLVHKLY HRLRFKLPPG PRPLPWGNL YDIKPVRFRC FADWAKTYGP 
61 IFSVYFGSQL NVWTTAELA KEVLKENDQN LADRFRTRPA NNLSRNGMDL IWADYGPHYV 
121 KVRKLCNLEL FTPKRLEALR PIREDEVTAM VENIFKDCTK PDNTGKSLLI REYLGSVAFN 
181 NITRLTFGKR FMNSKGEIDE QGQEFKGIVS NGIKIGGKLP LAEYVPWLRW FFTMENEALV 
241 KHSARRDRLT RMIMDEHTLA RKKTGDTKQH FVDALLTLQK QYDLSDDTVI GLLWDMITAG 
301 MDTTTITVEW AMAELVKNPR VQLKAQEELD RVI GTDRIMS ETDFSKLPYL QCVAKEALRL 
361 HPPTPLMLPH KASASVKIGG YDIPKGSIVH VNVWAVARDP AVWKNPLEFR PERFLEEDVD 
421 MKGHDYRLLP FGAGRRVCPG AQLAINLVTS MLGHLLHHFT WAPAPGVNPE DIDLEESPGT 
481 VTYMKNPIQA IPTPRLPAHL YGRVPVDM 
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NAME D249-AE8 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 255 

1 AATCACTAAT TTTCATGTAC TCTCATAGGT CAAAAGTTTC AACCAAAATC ATGGCTCTAT 
61 CCTTCATATT CATATCCATA ACCCTAATTT TTCTAGTTCA TAAACTCTAC CACCGTCTTA 
121 GATTCAAACT ACCACCAGGT CCGCGGCCGT TACCGGTGGT CGGAAACCTC TACGACATAG 
181 AACCGGTGAG ATTCCGGTGC TTTGCCGATT GGGCCAAAAC TTACGGTCCG ATTTTCTCAG 
241 TATACTTTGG GTCACAGTTA AATGTTGTGG TAACAACAGC TGAATTAGCT AAAGAAGTAT 
301 TGAAAGAAAA TGACCAGAAT TTAGCAGATA GATTTAGGAC TAGACCTGCA AATAATTTGA 
361 GCAGAAATGG GATGGATTTG ATTTGGGCTG ATTATGGGCC TCATTATGTG AAAGTAAGGA 
421 AGCTCTGTAA TCTTGAGCTT TTTACTCCTA AAAGACTTGA AGCTCTTAGA CCTATTAGAG 
481 AAGATGAAGT TACTGCTATG GTTGAAAACA TTTTCAAGGA TTGTACTAAG CCTGATAACA 
541 CAGGTAAAAG CTTGTTGATA AGAGAGTACT TAGGATCAGT AGCATTCAAC AACATTACAA 
601 GGTTAACATT TGGGAAAAGG TTCATGAACT CAAAAGGTGA GATTGATGAG CAAGGTCAAG 
661 AATTCAAGGG TATTGTCTCT AATGGCATCA AAATTGGCGG AAAACTTCCC TTGGCAGAGT 
721 ATGTTCCATG GCTCCGTTGG TTTTTCACAA TGGAAAACGA GGCACTCGTG AAGCACTCTG 
781 CACGTAGAGA CCGGTTAACA AGAATGATCA TGGATGAACA CACACTGGCT CGCAAGAAAA 
841 CTGGTGATAC TAAGCAGCAT TTTGTCGATG CATTGCTTAC TCTTCAGAAG CAGTATGATC 
901 TTAGTGATGA CACTGTTATT GGCCTCCTCT GGGATATGAT TACAGCAGGA ATGGACACAA 
961 CAACCATAAC AGTGGAATGG GCAATGGCAG AACTAGTTAA GAACCCAAGA GTGCAACTAA 
1021 AAGCTCAAGA GGAGCTTGAC AGGGTAATCG GAACGGATCG AATCATGTCA GAAACCGATT 
1081 TCTCTAAACT TCCTTACCTA CAATGTGTAG CCAAAGAGGC TCTAAGGTTG CACCCTCCAA 
1141 CTCCTCTAAT GCTTCCTCAT AGGGCCAGTG CCAGTGTCAA AATTGGTGGT TATGACATTC 
1201 CTAAGGGGTC CATCGTGCAC GTGAACGTTT GGGCTGTCGC TCGTGACCCA GCCGTGTGGA 
1261 AGAACCCGTT GGAGTTCAGA CCAGAGCGCT TCCTTGAGGA AGACGTTGAC ATGAAGGGTC 
1321 ACGACTATCG GTTATTGCCC TTTGGTGCAG GAAGGCGTGT TTGCCCCGGT GCACAACTTG 
1381 CTATCAACTT GGTCACATCT ATGTTGGGTC ATTTGTTGCA TCATTTTACA TGGGCTCCGG 
14 41 CCCCGGGGGT TAACCCGGAG GATATTGACT TGGAGGAGAG CCCTGGAACA GTAACTTACA 
1501 TGAAAAATCC AATACAAGCT ATTCCAACTC CAAGATTGCC TGCACACTTG TATGGACGTG 
1561 TGCCAGTGGA TATGTAAAAC 



1 MYSHRSKVST KIMALSFIFI SITLIFLVHK LYHRLRFKLP PGPRPLPWG NLYDIEPVRF 
61 RCFADWAKTY GPIFSVYFGS QLNVWTTAE LAKEVLKEND QNLADRFRTR PANNLSRNGM 
121 DLIWADYGPH YVKVRKLCNL ELFTPKRLEA LRPIREDEVT AMVENIFKDC TKPDNTGKSL 
181 LIREYLGSVA FNNITRLTFG KRFMNSKGEI DEQGQEFKGI VSNGIKIGGK LPLAEYVPWL 
241 RWFFTMENEA LVKHSARRDR LTRMIMDEHT LARKKTGDTK QHFVDALLTL QKQYDLSDDT 
301 VIGLLWDMIT AGMDTTTITV EWAMAELVKN PRVQLKAQEE LDRVIGTDRI MSETDFSKLP 
361 YLQCVAKEAL RLHPPTPLML PHRASASVKI GGYDIPKGSI VHVNVWAVAR DPAVWKNPLE 
421 FRPERFLEED VDMKGHDYRL LPFGAGRRVC PGAQLAINLV TSMLGHLLHH FTWAPAPGVN 
481 PEDIDLEESP GTVTYMKNPI QAIPTPRLPA HLYGRVPVDM 
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NAME D250-AC11 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 257 

1 ATAATGCTCT TTCTACTCTT TGTAGCCCTT CCTTTCATTC TTATTTTTCT TCTTCCTAAA 
61 TTCAAAAATG GTGGAAATAA CAGATTGCCA CCAGGTCCTA TAGGTTTACC ATTCATTGGA 
121 AATTTGCATC AATATGATAG TATAACTCCT CAT AT CT ATT TTTGGAAACT TTCCAAAAAA 
181 TATGGCAAAA TCTTCTCATT AAAACTTGCT TCTACTAATG TGGTAGTAGT TTCTTCAGCA 
241 AAATTAGCAA AAGAAGTATT GAAAAAACAA GATTTAATAT TTTGTAGTAG ACCATCTATT 
301 CTTGGCCAAC AAAAACTGTC TTATTATGGT CGTGATATTG CTTTTGCACC TTATAATGAT 
361 TATTGGAGAG AAATGAGAAA AATTTGTGTT CTTCATCTTT TTAGTTTAAA AAAAGTTCAA 
421 TTATTTAGTC CAATTCGTGA AGATGAAGTT TTTAGAATGA TTAAGAAAAT ATCAAAACAA 
481 GCTTCTACTT CACAAATTAT TAATTTGAGT AATTTAATGA TTTCATTAAC AAGTACAATT 
541 ATTTGTAGAG TTGCTTTTGG TGTTAGGTTT GAAGAAGAAG CACATGCAAG GAAGAGATTT 
601 GATTTTCTTT TGGCCGAGGC ACAAGAAATG ATGGCTAGTT TCTTTGTATC TGATTTTTTT 
661 CCCTTTTTAA GTTAGATTGA CAAATTAAGT GGATTGACAT ATAGACTTGA GAGGAATTTC 
721 AAGGATTTGG ATAATTTTTA TGAAGAACTC ATTGAGCAAC ATCAAAATCC TAATAAGCCA 
781 AAATATATGG AAG GAGAT AT TGTTGATCTT TTGCTACAAT TGAAGAAAGA GAAATTAACA 
841 CCACTTGATC TCACTATGGA AGATATAAAA GGAATTCTCA TGAATGTGTT AGTTGCAGGA 
901 TCAGACACTA GTGCAGCTGC TACTGTTTGG GCAATGACAG CCTTGATAAA, GAATCCTAAA 
961 GCCATGGAAA AAGTTCAATT AGAAATCAGA AAATCAGTTG GGAAGAAAGG CATTGTAAAT 
1021 GAAGAAGATG TCCAAAACAT CCCTTATTTT AAAGCAGTGA TAAAGGAAAT ATTTAGATTG 
1081 TATCCACCAG CTCCACTTTT AGTTCCAAGA GAATCAATGG AAAAAACCAT ATTAGAAGGT 
1141 TATGAAATTC GGCCAAGAAC CATAGTTCAT GTTAACGCTT GGGCTATAGC AAGGGATCCT 
1201 GAAATATGGG AAAATCCAGA TGAATTTATA CCTGAGAGAT TTTTGAATAG CAGTATCGAT 
1261 TACAAGGGTC AAGATTTTGA GTTACTTCCA TTTGGTGCAG GCAGAAGAGG TTGCCCAGGT 
1321 ATTGCACTTG GGGTTGCATC CATGGAACTT GCTTTGTCAA ATCTTCTTTA TGCATTTGAT 
1381 TGGGAGTTGC CTTATGGAGT GAAAAAAGAA GACATCGACA CAAACGTTAG GCCTGGAATT 
1441 GCCATGCACA AGAAAAACGA ACTTTGCCTT GTCCCAAAAA AATTATTTAT AAATTATATT 
1501 GGGACGTGGA TCTCATGCTA GTTCTGTGCG GTCAGCTAAG CTTA 

SEQ. ID. NO. 258 

1 MLFLLFVALP FILIFLLPKF KNGGNNRLPP GPIGLPFIGN LHQYDSITPH IYFWKLSKKY 
61 GKIFSLKLAS TNWWSSAK LAKEVLKKQD LIFCSRPSIL GQQKLSYYGR DIAFAPYNDY 
121 WREMRKICVL HLFSLKKVQL FSPIREDEVF RMIKKISKQA STSQIINLSN LMISLTSTII 
181 CRVAFGVRFE EEAHARKRFD FLLAEAQEMM ASFFVSDFFP FLS.IDKLSG LTYRLERNFK 
241 DLDNFYEELI EQHQNPNKPK YMEGDIVDLL LQLKKEKLTP LDLTMEDIKG ILMNVLVAGS 
301 DTSAAATVWA MTALIKNPKA MEKVQLEIRK SVGKKGIVNE EDVQNIPYFK AVIKEIFRLY 
361 PPAPLLVPRE SMEKTILEGY EIRPRTIVHV NAWAIARDPE IWENPDEFIP ERFLNSSIDY 
421 KGQDFELLPF GAGRRGCPGI ALGVASMELA LSNLLYAFDW ELPYGVKKED IDTNVRPGIA 
481 MHKKNELCLV PKKLFINYIG TWISC 
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NAME D259-AB9 
ORGANISM NI COT I ANA TABACUM 
SEQ. ID. NO. 259 

1 CACATTGAGT CCTCTCCCAA ATCACTGATT CACCACCAAA AGTACCAACA ATTCAATGGA 
61 AGGTACAAAC TTGACTACAT ATGCAGCAGT ATTTCTTGAT ACTCTGTTTC TTTTGTTCCT 
121 TTCCAAACTT CTTCGCCAGA GGAAACTCAA TTTACCTCCA GGCCCAAAAC CATGGCCGAT 
181 CATCGGAAAC TTAAACCTTA TTGGCAATCT TCCTCATCGC TCAATCCACG AACTCTCCCT 
241 CAAGTACGGA CCCGTTATGC AACTCCAATT CGGGTCTTTC CCCGTTGTAG TTGGATCCTC 
301 CGTCGAAATG GCTAAGATTT TCCTCAAATC CATGGATATT AACTTTGTAG GCAGGCCTAA 
361 AACGGCTGCC GGAAAATACA CAACGTACAA TTATTCCGAT ATTACATGGT CTCCTTACGG 
421 ACCATATTGG CGCCAGGCAC GTAGGATGTG CCTAACGGAA TTATTCAGCA CGAAACGTCT 
481 CGATTCATAC GAGTATATTC GGGCTGAGGA GTTGCATTCT CTTCTCCATA ATTTGAACAA 
541 AATATCAGGG AAACCAATTG TGTTGAAAGA TTATTTGACG ACGTTGAGTT TAAATGTTAT 
601 TAGCAGGATG GTACTGGGGA AAAGGTATTT GGACGAATCC GAGAACTCGT TCGTGAATCC 
661 TGAGGAATTT AAGAAGATGT TGGACGAATT GTTTTTGCTA AATGGTGTAC TTAATATTGG 
721 AGATTCAATT CCATGGATTG ATTTCATGGA TTTGCAAGGT TATGTTAAGA GGATGAAAGT 
781 AGTGAGCAAG AAATTCGACA AGTTTTTAGA GCATGTTATT GATGAGCATA ACATTAGGAG 
841 AAATGGAGTG GAGAATTATG TTGCTAAGGA TATGGTGGAT GTTTTGTTGC AGCTTGCTGA 
901 TGATCCGAAG TTGGAAGTTA AGCTGGAGAG ACATGGAGTC AAAGCATTCA CTCAGGATAT 
961 GCTGGCTGGT GGAACCGAGA GTTCAGCAGT GACAGTGGAG TGGGCAATTT CAGAGCTGCT 
1021 AAAGAAGCCG GAGATTTTCA AAAAGGCTAC AGAAGAATTG GATCGAGTAA TTGGGCAGAA 
1081 TAGATGGGTA CAAGAAAAGG ACATTCCAAA TCTTCCTTAC ATAGAGGCAA TAGTCAAAGA 
1141 GACTATGCGA CTGCACCCCG TGGCACCAAT GTTGGTGCCA CGTGAGTGTC GAGAAGATAT 
1201 TAAGGTAGCA GGCTACGACG TTCAGAAAGG AACTAGGGTT CTCGTGAGTG TATGGACTAT 
1261 TGGAAGAGAC CCTACATTGT GGGACGAGCC TGAGGTGTTC AAGCCGGAGA GATTCCATGA 
1321 AAAGTCCATA GATGTTAAAG GACATGATTA TGAGCTTTTG CCATTTGGAG CGGGGAGAAG 
1381 AATGTGCCCG GGTTATAGCT TGGGGCTCAA GGTGATTCAA GCTAGCTTAG CTAATCTTCT 
1441 ACATGGATTT AACTGGTCAT TGCCTGATAA TATGACTCCT GAGGACCTCA ACATGGATGA 
1501 GATTTTTGGG CTCTCTACAC CTAAAAAATT TCCACTTGCT ACTGTGATTG AGCCAAGACT 
1561 TTCACCAAAA CTTTACTCTG TTTGATTCAG CAGTTCTATG GTTCCGTCAA GATAG 

SEQ. ID. NO. 260 

1 MEGTNLTTYA AVFLDTLFLL FLSKLLRQRK LNLPPGPKPW PIIGNLNLIG NLPHRSIHEL 
61 SLKYGPVMQL QFGSFPWVG SSVEMAKIFL KSMDINFVGR PKTAAGKYTT YNYSDITWSP 
121 YGPYWRQARR MCLTELFSTK RLDSYEYIRA EELHSLLHNL NKISGKPIVL KDYLTTLSLN 
181 VISRMVLGKR YLDESENSFV NPEEFKKMLD ELFLLNGVLN IGDSIPWIDF MDLQGYVKRM 
241 KWSKKFDKF LEHVIDEHNI RRNGVENYVA KDMVDVLLQL ADDPKLEVKL ERHGVKAFTQ 
301 DMLAGGTESS AVTVEWAISE LLKKPEIFKK ATEELDRVIG QNRWVQEKDI PNLPYIEAIV 
361 KETMRLHPVA PMLVPRECRE DIKVAGYDVQ KGTRVLVSVW TIGRDPTLWD EPEVFKPERF 
421 HEKSIDVKGH DYELLPFGAG RRMCPGYSLG LKVIQASLAN LLHGFNWSLP DNMTPEDLNM 
481 DEIFGLSTPK KFPLATVIEP RLSPKLYSV 
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NAME D218A-AC2 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 261 

1 CTTCTTCCTT CCTAACTAAA AATGGAGATT CAGTTTTCTA ACTTAGTTGC ATTCTTGCTC 
61 TTTCTCTCCA GCATCTTTCT TGTATTCAAA AAATGGAAAA CCAGAAAACT AAATTTGCCT 
121 CCTGGTCCAT GGAAATTACC TTTTATTGGA AGTTTACACC ATTTGGCTGT GGCAGGTCCA 
181 CTTCCTCACC ATGGCCTAAA AAATTTAGCC AAACGCTATG GTCCTCTTAT GCATTTACAA 
241 CTTGGACAAA TTCCTACACT CGTCATATCA TCACCTCAAA TGGCAAAAGA AGTACTAAAA 
301 ACTCACGACC TCGCTTTTGC CACTAGACCA AAGCTTGTCG TGGCCGACAT CATTCACTAC 
361 GACAGCACGG ACATAGCACT TTCGCCATAC GGTGAATACT GGAGACAAAT TCGTAAAATT 
421 TGCATATTGG AACTCTTGAG TGCCAAGATG GTCAAGTTTT TTAGCTCGAT TCGCCAAGAT 
481 GAGCTCTCGA AGATGGTTTC ATCTATACGA ACGACGCCCA ATCTTCCAGT CAATCTTACC 
541 GACAAGATTT TTTGGTTTAC GAGTTCGGTA ATTTGTAGAT CAGCTTTAGG GAAGATATGT 
601 GGTGACCAAG ACAAATTGAT CATTTTTATG AGGGAAATAA TATCATTGGC AGGTGGATTT 
661 AGTATTGCTG ATTTTTTCCC TACATGGAAA ATGATTCATG ATATTGATGG TTCAAAATCT 
721 AAACTGGTGA AGGCACATCG TAAGATTGAT GAAATTTTGG AAAATGTGGT AAAT GAGCAC 
781 AAACAGAATC GAGCAGATGG TAAAAAGGGT AATGGTGAAT TTGGTGGAGA AGATCTGATT 
841 GATGTTTTGT TAAGAGTTAG AGAAAGT GGA GAAGTTCAAA TTCCAATCAC AGATGACAAT 
901 ATCAAATCAA TATTAATCGA CATGTTCTCT GCCGGATCGG AAACATCATC GACAACTATA 
961 ATTTGGGCAT TAGCTGAAAT GATGAAGAAA CCAAGTGTTT TAGCAAAGGC ACAAGCT GAA 
1021 GTGAGCCAAG CTTTGAAGGG GAAGAAAATT AGTTTTCAAG AGATTGATAT TGATAAGCTA 
1081 AAGTATTTGA AGTTAGTGAT CAAAGAAACT TTAAGAATGC ACCCTCCAAT TCCTCTGTTA 
1141 GTCCCTAGAG AATGTATGGA AGATACAAAG ATTGATGGTT ACAAT AT AC C TTTCAAAACA 
1201 AGAGTCATTG TTAATGCATG GGCAATTGGA CGAGATCCTC AAAGTTGGGA TGATCCTGAA 
1261 AGCTTTACGC CAGAGAGATT TGAGAATAAT TCTATTGATT TTCTTGGAAA TCATCATCAA 
1321 TTTATTCCAT TTGGTGCAGG AAGAAGGATT TGTCCTGGAA TGCTATTTGG TTTAGCTAAT 
1381 GTTGGACAAC CTTTAGCTCA GTTACTTTAT CACTTCGATT GGAAACTCCC TAATGGACAA 
1441 ACTCACCAAA ATTTCGACAT GACTGAGTCA CCTGGAATTT CTGCTACAAG AAAGGAT GAT 
1501 CTTATTTTGA TTGCCACTCC TGCTCATTCT TGATTAAGTA TTGCTGCTTT TCTATTGGAG 
1561 AATTTTCAAA ATTCATCCAC AATATATAGT GTTTGCTAGA GTTGGTTAGC 

SEQ . ID. NO. 262 

1 MEIQFSNLVA FLLFLSSIFL VFKKWKTRKL NLPPGPWKLP FIGSLHHLAV AGPLPHHGLK 
61 NLAKRYGPLM HLQLGQIPTL VISSPQMAKE VLKTHDLAFA TRPKLWADI IHYDSTDIAL 
121 SPYGEYWRQI RKICILELLS AKMVKFFSSI RQDELSKMVS SIRTTPNLPV NLTDKIFWFT 
181 SSVICRSALG KICGDQDKLI IFMREIISLA GGFSIADFFP TWKMIHDIDG SKSKLVKAHR 
241 KIDEILENW NEHKQNRADG KKGNGEFGGE DLIDVLLRVR ESGEVQIPIT DDNIKSILID 
301 MFSAGSETSS TTIIWALAEM MKKPSVLAKA QAEVSQALKG KKISFQEIDI DKLKYLKLVI 
361 KETLRMHPPI PLLVPRECME DTKIDGYNIP FKTRVIVNAW AIGRDPQSWD DPESFTPERF 
421 ENNSIDFLGN HHQFIPFGAG RRICPGMLFG LANVGQPLAQ LLYHFDWKLP NGQTHQNFDM 
481 TESPGISATR KDDLILIATP AHS 
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NAME D210-BD4 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 263 

1 CTTTCATCAT ATGGCATGAA ATGGGAAATG CTCACAACAG CAAAATTGCA GCAATCTGTT 
61 TGATAATTTT CTTGGTATAT AAAGCATGGG AATTGTTGAA GTGGATATGG ATTAAGCCAA 
121 AGAAACTGGA GAGTTGCCTC AGAAAACAGG GACTCAAAGG AAATtCCTAC GGGCTATTCT 
181 ATGGAGATAT GAAAGAATtG TCCAAAAGTC TCAAGGAAAT CAATTCAAAG CCCATCATCA 
241 ATCTATCAAA TGAAGTAGCC CCAAGAATCA TTCCTTATtA TCTTGAAATC ATCCAAAAAT 
301 ATGGTAAAAG ATGTTTTGTT TGGCAAGGAC CAACCCCCGC AATATTAATA ACAGAGCCAG 
361 AATTAATAAA GGAGATATTT GGTAAGAACT ATGTTTTTCA GAAGCCTAAT AATCCCAACC 
421 CACTGACCAA GTTATTGGCT CGAGGTGTTG TAAGCTACGA GGAAGAAAAA TGGGCAAAAC 
481 ACAGAAAGAT CTTAAATCCT GCCTTTCATA TGGAGAAGTT GAAGCATATG CTACCAGCAT 
541 TTTACTTGAG CTGTAGTGAG ATGCTGAACA AATGGGAGGA GATTATCCCA GTAAAAGAAT 
601 CAAATGAGTT GGACATTTGG CCTCATCTTC AAAGAATGAC AAGTGATGTG ATTTCTCGTG 
661 CTGCCTTTGG TAGTAGCTAC GAAGAAGGAA GAAGAATATT TGAACTTCAA GAAGAACAAG 
721 CTGAGTATCT AACGAAGACA TTCAATTCAG TTTATATCCC AGGTTCCAGA TTTTTTCCCA 
781 ATAAAATGAA CAAAAGAATG AAAGAATGTG AAAAGGAAGT ACGAGAAACA ATTACGTGTC 
841 TAATTGACAA CAGATTAAAG GCAAAAGAAG AAGGCAATGG CAAGGCCCTC AATGATGACC 
901 TATTGGGTAT ATTATTAGAG TCAAATTCTA TAGAAATTGA AGAACATGGT AACAAGAAGT 
961 TTGGAATGAG TATACCTGAA GTAATTGAAG AGTGCAAATT ATTCTATTTT GCTGGCCAAG 
1021 AGACTACATC AGTATTGCTT GTGTGGACAC TGATTTTGTT AGGGAGAAAt cCAGAATGGC 
1081 AGGAACGTGC TAGAGAGGAA GTTTTTCAAG CCTTTGGAAG TGATAAACCA ACTTTTGACG 
1141 AATTATATCG CTTGAAAATT GTGACGATGA TTTTGTACGA GTCTTTAAGG TTATATCCAC 
1201 CAATAGCAAC TCGTACTCGA AGGACTAATG AAGAAACAAA ATTAGGGGAA CTAGATTTAC 
1261 CAAAGGGTGC ACTGCTCTTT ATACCAACAA TCTTATTACA TCTTGACAGG GAAATTTGGG 
1321 GTGAAGATGC AGATGAGTTC AATCCGGAGA GATTTAGCGA AGGGGTGGCA AAGGCAACAA 
1381 AGGGGAAAAT GACATATTTT CCATTTGGTG CAGGACCGCG AAAATGCATT GGGCAAAACT 
1441 TCGCGATTTT GGAAGCAAAA ATGGCTATAG CTATGATTCT ACAACGCTTC TCCTTCGAGC 
1501 TCTCTCCATC TTATACACAC TCTCCATACA CTGTGGTCAC TTTGAAACCC AAATATGGTG 
1561 CTCCCCTAAT AATGCACAGG CTGTAGTCCT GTGAGAATAT GCTATCCGAG G 

SEQ. ID. NO. 264 

1 MGNAHNSKIA AICLIIFLVY KAWELLKWIW IKPKKLESCL RKQGLKGNSY GLFYGDMKEL 
61 SKSLKEINSK PIINLSNEVA PRIIPYYLEI IQKYGKRCFV WQGPTPAILI TEPELIKEIF 
121 GKNYVFQKPN NPNPLTKLLA RGWSYEEEK WAKHRKILNP AFHMEKLKHM LPAFYLSCSE 
181 MLNKWEEIIP VKESNELDIW PHLQRMTSDV ISRAAFGSSY EEGRRIFELQ EEQAEYLTKT 
241 FNSVYIPGSR FFPNKMNKRM KECEKEVRET ITCLIDNRLK AKEEGNGKAL NDDLLGILLE 
301 SNSIEIEEHG NKKFGMSIPE VIEECKLFYF AGQETTSVLL VWTLILLGRN PEWQERAREE 
361 VFQAFGSDKP TFDELYRLKI VTMILYESLR LYPPIATRTR RTNEETKLGE LDLPKGALLF 
421 IPTILLHLDR EIWGEDADEF NPERFSEGVA KATKGKMTYF PFGAGPRKCI GQNFAILEAK 
481 MAIAMI LQRF SFELSPSYTH SPYTWTLKP KYGAPLIMHR L 
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NAME D233-AG7 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 265 

1 CTCATTATCC ATCACCTAAA ATGGAGAATT CTTGGGTTTT TCTAGCCTTG GCAGGGCTAT 
61 CTGCATTAGC TTTTCTCTGT AAAATAATCA CCTGTCGAAG ACCGGTTAAC CGGAAAATAC 
121 CACCAGGTCC AAAACCATGG CCCATCATTG GCAATTTGAA CCTACTTGGT CCTATACCAC 
181 ATCAATCTTT TGACTTGCTT TCCAAAAAAT ATGGAGAGTT GATGCTGCTG AAATTTGGCT 
241 CCAGGCCAGT TCTTGTTGCT TCATCTGCTG AAATGGCAAA ACAGTTTTTA AAAGTACATG 
301 ATGCTAATTT CGCCTCCCGT CCTATGCTAG CTGGTGGAAA GTATACAAGC TATAACTATT 
361 GTGACATGAC ATGGGCACCC TATGGTCCCT ATTGGCGCCA AGCACGACGA ATTTACCTTA 
421 ACCAGATATT TACTCCGAAA AGGCTAGACT CGTTCGAGTA CATTCGTGTT GAAGAAAGGC 
481 AGGCCTTGAT TTCCCAGCTG AATTCCCTTG CTGGAAAGCC ATTTTTTCTC AAAGACCATT 
541 TGTCGCGATT TAGCCTCTGC AGCATGACAA GGATGGTTTT GAGCAACAAG TATTTTGGTG 
601 AATCAACAGT TAGAGTAGAA GATTTGCAGT ACCTGGTAGA TCAATGGTTC TTACTTAATG 
661 GTGCTTTCAA CATTGGAGAT TGGATTCCAT GGCTCAGCTT CTTGGACCTA CAAGGCTATG 
721 TGAAACAAAT GAAGGCTTTG AAAAGAACTT TTGATAAGTT CCACAACATT GTGCTAGATG 
781 ATCACAGGGC TAAGAAGAAT GCAGAGAAGA ACTTTGTCCC AAAAGACATG GTTGATGTCT 
841 TGTTGAAGAT GGCTGAAGAT CCTAATCTGG AAGTCAAACT CACTAATGAC TGTGTCAAAG 
901 GGTTAATGCA GGATTTACTA ACTGGAGGAA CAGATAGCTT AACAGCAGCA GTGCAATGGG 
961 CATTTCAAGA ACTTCTTAGA CAGCCAAGGG TTATTGAGAA GGCAACCGAA GAGCTTGACC 
1021 GGATTGTCGG GAAAGAGAGA TGGGTAGAAG AGAAAGATTG CTCGCAGCTA TCTTACGTTG 
1081 AAGCAATCCT CAAGGAAACA CTAAGGTTAC ATCCTCTAGG AACTATGCTA GCACCGCATT 
1141 GTGCTATAGA AGATTGTAAC GTGGCTGGTT ATGACATACA GAAAGGAACG ACCTTTCTGG 
1201 TGAATGTTTG GACCATTGGA AGGGACCCAA AATACTGGGA TAGAGCACAA GAGTTTCTCC 
1261 CCGAGAGATT TTTAGAGAAC GACATTGATA TGGACGGACA TAACTTTGCT TTCTTGCCAT 
1321 TTGGCTCGGG GCGAAGGAGG TGCCCTGGCT ATAGCCTTGG ACTTAAGGTT ATCCGAGTAA 
1381 CATTAGCCAA GATGTTGCAT GGATTCAACT GGAAATTACC TGAAGGTATG AAGCCAGAAG 
1441 ATATAAGTGT GGAAGAACAT TATGGGCTCA CTACACATCC TAAGTTTCCT GTTCCTGTGA 
1501 TCTTGGAATC TAGACTTTCT TCAGATCTCT ATTCCCCCAT CACTTAATCC TAAGTGCTTC 
1561 CTATTATAGC 

SEQ. ID. NO. 266 

1 MENSWVFLAL AGLSALAFLC KIITCRRPVN RKIPPGPKPW PIIGNLNLLG PIPHQSFDLL 

61 SKKYGELMLL KFGSRPVLVA SSAEMAKQFL KVHDANFASR PMLAGGKYTS YNYCDMTWAP 

121 YGPYWRQARR IYLNQIFTPK RLDSFEYIRV EERQALISQL NSLAGKPFFL KDHLSRFSLC 

181 SMTRMVLSNK YFGESTVRVE DLQYLVDQWF LLNGAFNIGD WIPWLSFLDL QGYVKQMKAL 

241 KRTFDKFHNI VLDDHRAKKN AEKNFVPKDM VDVLLKMAED PNLEVKLTND CVKGLMQDLL 

301 TGGTDSLTAA VQWAFQELLR QPRVIEKATE ELDRIVGKER WVEEKDCSQL SYVEAILKET 

361 LRLHPLGTML APHCAIEDCN VAGYDIQKGT TFLVNVWTIG RDPKYWDRAQ EFLPERFLEN 
421 DIDMDGHNFA FLPFGSGRRR CPGYSLGLKV IRVTLANMLH GFNWKLPEGM KPEDISVEEH 
481 YGLTTHPKFP VPVILESRLS SDLYSPIT 
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NAME D257-AE4 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 267 

1 CACATTGAGT CCTCTCCCAA ATCACTGATT CACCACCAAA AGTACCAACA ATTCAATGGA 
61 AGGTACAAAC TTGACTACAT ATGCAGCAGT ATTTCTTGAT ACTCTGTTTC TTTTGTTCCT 
121 TTCCAAACTT CTTCGCCAGA GGAAACTCAA TTTACCTCCA GGCCCAAAAC CATGGCCGAT 
181 CATCGGAAAC TTAAACCTTA TTGGCAATCT TCCTCATCGC TCAATCCACG AACTCTCCCT 
241 CAAGTACGGA CCCGTTATGC AACTCCAATT CGGGTCTTTC CCCGTTGTAG TTGGATCCTC 
301 CGTCGAAATG GCTAAGATTT TCCTCAAATC CATGGATATT AACTTTGTAG GCAGGCCTAA 
361 AACGGCTGCC GGAAAATACA CAACGTACAA TTATTCCGAT ATTACATGGT CTCCTTACGG 
421 ACCATATTGG CGCCAGGCAC GTAGGATGTG CCTAACGGAA TTATTCAGCA CGAAACGTCT 
481 C GAT T CAT AC GAGTATATTC GGGCTGAGGA GTTGCATTCT CTTCTCCATA ATTT GAACAA 
541 AATATCAGGG AAACCAATTG TGTTGAAAGA TTATTTGACG ACGTTGAGTT TAAATGTTAT 
601 TAGCAGGATG GTACTGGGGA AAAGGTATTT GGACGAATCC GAGAACTCGT TCGTGAATCC 
661 TGAGGAATTT AAGAAGATGT TGGACGAATT GTTTTTGCTA AATGGTGTAC TTAATATTGG 
721 AGATTCAATT CCATGGATTG ATTTCATGGA TTTGCAAGGT TATGTTAAGA GGATGAAAGT 
781 AGTGAGCAAG AAATTCGACA AGTTTTTAGA GCATGTTATT GATGAGCATA ACATTAGGAG 
841 AAATGGAGTG GAGAATTATG TTGCTAAGGA TATGGTGGAT GTTTTGTTGC AGCTTGCTGA 
901 TGATCCGAAG TTGGAAGTTA AGCTGGAGAG ACATGGAGTC AAAGCATTCA CTCAGGATAT 
961 GCTGGCTGGT GGAACCGAGA GTTCAGCAGT GACAGTGGAG TGGGCAATTT CAGAGCTGCT 
1021 AAAGAAGCCG GAGATTTTCA AAAAGGCTAC AGAAGAATTG GATCGAGTAA TTGGGCAGAA 
1081 TAGATGGGTA CAAGAAAAGG ACATTCCAAA TCATCcTTAC ATAGAGGCAA TAGTCAAAGA 
1141 GACTATGCGA CTGCACCCCG TGGCACCAAT GTTGGTGCCA CGTGAGTGTC GAGAAGATAT 
1201 TAAGGTAGCA GGCTACGACG TTCAGAAAGG AACTAGGGTT CTCGTGAGTG TATGGACTAT 
1261 TGGAAGAGAC CCTACATTGT GGGACGAGCC TGAGGTGTTC AAGCCGGAGA GATTCCATGA 
1321 AAAGTCCATA GATGTTAAAG GACATGATTA TGAGCTTTTG CCATTTGGAG CGGGGAGAAG 
1381 AATGTGCCCG GGTTATAGCT TGGGGCTCAA GGTGATTCAA GCTAGCTTAG CTAATCTTCT 
1441 ACATGGATTT AACTGGTCAT TGCCTGATAA TATGACTCCT GAGGACCTCA ACATGGATGA 
1501 GATTTTTGGG CTCTCTACAC CTAAAAAATT TCCACTTGCT ACTGTGATTG AGCCAAGACT 
1561 TTCACCAAAA CTTTACTCTG TTTGATTCAG CAGTTCTATG GATCCGTCAA GATAGAC 

SEQ. ID. NO. 268 

1 MEGTNLTTYA AVFLDTLFLL FLSKLLRQRK LNLPPGPKPW PIIGNLNLIG NLPHRSIHEL 
61 SLKYGPVMQL QFGSFPVWG SSVEMAKIFL KSMDINFVGR PKTAAGKYTT YNYSDITWSP 
121 YGPYWRQARR MCLTELFSTK RLDSYEYIRA EELHSLLHNL NKISGKPIVL KDYLTTLSLN 
181 VISRMVLGKR YLDESENSFV NPEEFKKMLD ELFLLNGVLN IGDSIPWIDF MDLQGYVKRM 
241 KWSKKFDKF LEHVIDEHNI RRNGVENYVA KDMVDVLLQL ADDPKLEVKL ERHGVKAFTQ 
301 DMLAGGTESS AVTVEWAISE LLKKPEIFKK ATEELDRVIG QNRWVQEKDI PNHPYIEAIV 
361 KETMRLHPVA PMLVPRECRE DIKVAGYDVQ KGTRVLVSVW TIGRDPTLWD EPEVFKPERF 
421 HEKSIDVKGH DYELLPFGAG RRMCPGYSLG LKVIQASLAN LLHGFNWSLP DNMTPEDLNM 
481 DEIFGLSTPK KFPLATVIEP RLSPKLYSV 
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NAME D268-AE2 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 269 

1 TGCAATATAG TTTTCCTAGT CAGTTCTAGC CTCCTTTTCC TTAGAAATAA TGGATTATCA 
61 TATTTCTTTC CATTTTCAAG CTCTTTTAGG GCTTTTAGCC TTTGTGTTCT TGTCTATTAT 
121 CTTATGGAGA AGAACACTCA CTTCAAGAAA ATTAGCCCCT GAAATCCCAG GGGCATGGCC 
181 TATTATAGGC CATCTTCGTC AGCTGAGTGG TACT GAT AAG AATATCCCAT TTCCCCGAAT 
241 ATTGGGCGCT TTGGCAGATA AATATGGACC TGTCTTCACA CTGAGAATAG GGATGTACCC 
301 CTATTTGATT GTCAACAATT GGGAAGCAGC TAAGGATTGT CTCACAACGC ATGATAAGGA 
361 CTTCGCTGCC CGACCAACTT CTATGGCTGG TGAAAGCATC GGGTACAAGT ATGCGAGGTT 
421 TACTTATGCT AATTTTGGTC CTTATTATAA CCAAGTGCGC AAACTAGCCC TACAACATGT 
481 ACTCTCGAGT ACTAAACTCG AGAAAATGAA ACACATACGT GTTTCTGAAT TGGAAACTAG 
541 CATCAAAGAA TTATATTCTT TGACGCTGGG CAAAAACAAC ATGCAAAAAG TGAATATAAG 
601 TAAATGGTTT GAACAATTGA CTTTAAACAT AATCGTGAAG ACAATTTGTG GCAAGAGATA 
661 TAGCAACATA GAGGAGGATG AAGAGGCACA ACGTTTCAGA AAGGCATTTA AGGGCATCAT 
721 GTTTGTTGTA GGGCAAATTG TTTTATATGA CGCAATTCCA TTCCCATTGT TCAAATACTT 
781 TGATTTCCAA GGTCATATAC AATTGATGAA CAAAATTTAT AAAGACTTAG ATTCTATTCT 
841 TCAAGGATGG TTGGATGATC ATATGATGAA CAAGGATGTA AACAATAAGG ATCAAGATGC 
901 CATAGATGCC ATGCTTAAGG TAACACAACT TAATGAATTC AAAGCCTATG GTTTTTCTCA 
961 GGCCACTGTG ATCAAGTCGA CAGTCTTGAG TTTGATCTTA GATGGAAATG ACACAACCGC 
1021 TGTTCATTTG ATATGGGTAA TGTCCTTATT ACTGAACAAT CCACATGTTA TGAAACAAGG 
1081 CCAAGAAGAG ATAGACATGA AAGTGGGTAA AGAGAGGTGG ATTGAAGATA CTGACATAAA 
1141 AAATTTAGTG TACCTTCAGG CTATCGTTAA AGAGACATTG CGCTTGTATC CACCTGTTCC 
1201 TTTTCTTTTA CCACACGAAG CAGTGCAAGA TTGTAAAGTG ACTGGTTACC ACATTCCTAA 
1261 AGGTACTCGT CTATATATCA ATGCGTGGAA AGTACATCGC GATTCTGAAA TTTGGTCAGA 
1321 GCCCGAAAAG TTTATGCCCA ATAGATTCTT GACTAGCAAA GCAAATATAG ATGCTCGCGG 
1381 TCAAAATTTT GAATTTATAC CGTTTGGTTC TGGGAGACGG TCATGTCCAG GGTTAGGTTT 
1441 TGCGACTTTA GTGACACATC TGACTTTTGG TCGCTTGCTT CAAGGTTTTG ATTTTAGTAA 
1501 GCCATCAAAC ACGCCAATTG ACATGACAGA AGGCGTAGGC GTTACTTTGC CTAAGGTTAA 
1561 TCAAGTTGAA GTTCTAATTA CCCCTCGTTT ACCTTCTAAG CTTTATTTAT TTTGAAAGTG 
1621 CAAATCATCA ATCATGGGTT GAGTAATTAG TGATACT 

SEQ. ID. NO. 270 

1 MDYHISFHFQ ALLGLLAFVF LSIILWRRTL TSRKLAPEIP GAWPIIGHLR QLSGTDKNIP 
61 FPRILGALAD KYGPVFTLRI GMYPYLIVNN WEAAKDCLTT HDKDFAARPT SMAGESIGYK 
121 YARFTYANFG PYYNQVRKLA LQHVLSSTKL EKMKHIRVSE LETSIKELYS LTLGKNNMQK 
181 VNISKWFEQL TLNIIVKTIC GKRYSNIEED EEAQRFRKAF KGIMFWGQI VLYDAIPFPL 
241 FKYFDFQGHI QLMNKIYKDL DSILQGWLDD HMMNKDVNNK DQDAIDAMLK VTQLNEFKAY 
301 GFSQATVIKS TVLSLILDGN DTTAVHLIWV MSLLLNNPHV MKQGQEEIDM KVGKERWIED 
361 TDIKNLVYLQ AIVKETLRLY PPVPFLLPHE AVQDCKVTGY HIPKGTRLYI NAWKVHRDSE 
421 IWSEPEKFMP NRFLTSKANI DARGQNFEFI PFGSGRRSCP GLGFATLVTH LTFGRLLQGF 
481 DFSKPSNTPI DMTEGVGVTL PKVNQVEVLI TPRLPSKLYL F 
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NAME D283-AC1 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 271 

1 AGAGAGTGAA AATGGACGCA CTACTTCAAA TGACAGTAAC AGCATCTTGT GCTGCCATAG 
61 TAATTACTCT GCTGGTGTGT ATATGGAGAG TGCTGAACTG GATTTGGTTC AGACCAAAGA 
121 AATTGGAGTT GTTGTTGAGA AAACAAGGTT TGGAAGGAAA TTCTTACAAG GTTTTGTATG 
181 GGGACATGAA AGAGTTTTCT GGGATGATTA AGGAAGCATA CTCAAAGCCT ATGAGTCTAT 
241 CTGATGATGT AGCACCAAGA CTGATGCCTT TCTTTCTTGA AACCATCAAA AAATATGGAA 
301 AAAGATCCTT TATATGGTTT GGTCCAAGAC CACTAGTATT GATTATGGAT CCTGAGCTTA 
361 TAAAGGAAGT ACTCTCAAAA ATCCATCTGT ATCAAAAGCC TGGTGGAAAT CCATTAGCAA 
421 CACTATTGGT ACAAGGAATA GCAACCTATG AGGAAGACAA ATGGGCCAAA CATAGAAAAA 
481 TCATCAATCC CGCTTTCCAT CTAGAGAAGC TAAAGCTTAT GCTTCCAGCA TTTCGCTTAA 
541 GCTGTAGTGA GATGCTGAGC AAATGGGAAG ACATTGTTTC AGCTGATAGC T CACAT GAGA 
601 TAGATGTATG GTCTCACCTT GAGCAATTGA CTTGCGATGT GATCTCTCGG ACAGCTTTTG 
661 GCAGTAGTTA TGAAGAAGGT AGAAAGATTT TTGAACTTCA AAAGGAACAA GCTCAGTATC 
721 TTGTGGAAGT TTTCCGCTCC GTTTATATCC CAGGAAGGAG ATTTTTGCCA ACAAAGAGGA 
781 ATAGAAGAAT GAAGGAAATA AAAAAGGATG TCCGGGCATC AATTAAAGGT ATTATTGATA 
841 AAAGATTGAA GGCAATGAAA GCAGGGGACA CCAATAATGA GGATCTATTG GGTATATTAC 
901 TGGAATCGaA TATTAAAGAA ATTGAACAGC ACGGAAACAA GGATTTTGGA ATGAGCATTG 
961 AAGAAGTCAT TGAAGAATGC AAGTTATTCT ATTTTGCTGG CCAAGAAACT ACATCAGTGT 
1021 TACTCCTATG GTCTCTAGTG TTGTTGAGCA GGTATCAAGA TTGGCAGGCA CGGGCCAGAG 
1081 AAGAAATCTT GCAAGTCTTT GGCAGTCGAA AACCAGATTT TGACGGATTA AATCATCTAA 
1141 AAATTGTGAC AATGATCTTG TACGAGTCTT TAAGGCTGTA TCCCTCACTA ATAACACTTA 
1201 CCCGCCGGTG TAATGAAGAC ATT GT ATT AG GAGAACTATC TCTACCAGCT GGTGTTCTAG 
1261 TCTCTTTGCC ATTGATTTTG TTGCATGATG ATGAAGAGAT ATGGGGTGAA GATGCAAAGG 
1321 AGTTCAAACC AGAGAGATTT AGAGAAGGAA TATCAAGTGC AACAAAGGGT CAACT CACAT 
1381 ATTTTCCATT TAGCTGGGGT CCTAGAATAT GTATTGGACA AAATTTTGCC ATGTTAGAAG 
1441 CAAAGATGGC TCTGTCTATG ATCCTGCAAC GCTTCTCTTT TGAACTGTCT CCGTCTTATG 
1501 CACATGCCCC TCGGTCCATA ATAACCGTTC AGCCTCAGTA TGGTGCTCCA CTTATTTTCC 
1561 ACAAACTATA ATTTTGGTAC TTCTACTAAT ATTTTAGGGT TTATTCAGAC TCAAAAAAAA 

SEQ. ID. NO. 272 

1 MTVTASCAAI VITLLVCIWR VLNWIWFRPK KLELLLRKQG LEGNSYKVLY GDMKEFSGMI 

61 KEAYSKPMSL SDDVAPRLMP FFLETIKKYG KRSFIWFGPR PLVLIMDPEL IKEVLSKIHL 

121 YQKPGGNPLA TLLVQGIATY EEDKWAKHRK IINPAFHLEK LKLMLPAFRL SCSEMLSKWE 

181 DIVSADSSHE IDVWSHLEQL TCDVISRTAF GSSYEEGRKI FELQKEQAQY LVEVFRSVYI 

241 PGRRFLPTKR NRRMKEIKKD VRASIKGIID KRLKAMKAGD TNNEDLLGIL LESNIKEIEQ 

301 HGNKDFGMSI EEVIEECKLF YFAGQETTSV LLLWSLVLLS RYQDWQARAR EEILQVFGSR 

361 KPDFDGLNHL KIVTMILYES LRLYPSLITL TRRCNEDIVL GELSLPAGVL VSLPLILLHH 

421 DEEIWGEDAK EFKPERFREG ISSATKGQLT YFPFSWGPRI CIGQNFAMLE AKMALSMILQ 

481 RFSFELSPSY AHAPRSIITV QPQYGAPLIF HKL 
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NAME D244-AB6 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 273 

1 TGCAATATAG TTTTCCTAGT CAGTTCTAGC CTCCTTTTCC TTAGAAATAA TGGATTATCA 
61 TATTTCTTTC CATTTTCAAG CTCTTTTAGG GCTTTTAGCC TTTGTGTTCT TGTCTATTAT 
121 CTTATGGAGA AGAACACTCA CTTCAAGAAA ATTAGCCCCT GAAATCCCAG GGGCATGGCC 
181 TATTATAGGC CATCTTCGTC AGCTGAGTGG TACTGATAAG AATATCCCAT TTCCCCGAAT 
241 ATTGGGCGCT TTGGCAGATA AATATGGACC TGTCTTCACA CT GAGAATAG GGATGTACCC 
301 CTATTTGATT GTCAACAATT GGGAAGCAGC TAAGGATTGT CTCACAACGC ATGATAAGGA 
361 CTTggCTGCC CGACCAACTT CTATGGCTGG TGAAAGCATC GGGTACAAGT ATGCGAGGTT 
421 TACT TAT GCT AATTTTGGTC CTTATTATAA CCAAGTGCGC AAACTAGCCC TACAACATGT 
481 ACTCTCGAGT ACTAAACTCG AGAAAATGAA ACACATACGT GTTTCTGAAT TGGAAACTAG 
541 CATCAAAGAA TTATATTCTT TGACGCTGGG CAAAAACAAC ATGCAAAAAG TGAATATAAG 
601 TAAATGGTTT GAACAATTGA CTTTAAACAT AATCGTGAAG ACAATTTGTG GCAAGAGATA 
661 TAGCAACATA GAGGAGGATG AAGAGGCACA ACGTTTCAGA AAGGCATTTA AGGGCATCAT 
721 GTTTGTTGTA GGGCAAATTG TTTTATATGA CGCAATTCCA TTCCCATTGT TCAAATACTT 
781 TGATTTCCAA GGT CATATAC AATTGATGAA CAAAATTTAT AAAGACT TAG ATTCTATTCT 
841 TCAAGGATGG TTGGATGATC ATATGATGAA CAAGGATGTA AACAATAAGG ATCAAGATGC 
901 CATAGATGCC ATGCTTAAGG TAACACAACT TAATGAATTC AAAGCCTATG GTTTTTCTCA 
961 GGCCACTGTG ATCAAGTCGA CAGTCTTGAG TTTGATCTTA GATGGAAATG ACACAACCGC 
1021 TGTTCATTTG ATATGGGTAA TGTCCTTATT ACTGAACAAT CCACATGTTA TGAAACAAGG 
1081 CCAAGAAGAG AT AG AC AT GA AAGTGGGTAA AGAGAGGTGG ATTGAAGATA CTGACATAAA 
1141 AAATTTAGTG TACCTTCAGG CTATCGTTAA AGAGACATTG CGCTTGTATC CACCTGTTCC 
1201 TTTTCTTTTA CCACACGAAG CAGTGCAAGA TTGTAAAGTG ACTGGTTACC ACATTCCTAA 
1261 AGGTACTCGT CTATATATCA ATGCGTGGAA AGTACATCGC GATCCTGAAA TTTGGTCAGA 
1321 GCCCGAAAAG TTTATGCCCA ATAGATTCTT GACTAGCAAA GCAAATATAG ATGCTCGCGG 
1381 TCAAAATTTT GAATTTATAC CGTTTGGTTC TGGGAGACGG TCATGTCCAG GGATAGGTTT 
1441 TGCGACTTTA GTGACACATC TGACTTTTGG TCGCTTGCTT CAAGGTTTTG ATTTTAGTAA 
1501 GCCATCAAAC ACGCCAATTG ACATGACAGA AGGCGTAGGC GTTACTTTGC CTAAGGTTAA 
1561 TCAAGTTGAA GTTCTAATTA CCCCTCGTTT ACCTTCTAAG CTTTATTTAT TTTGAAGGTG 
1621 CAAATCATCA ATCATGGCTT GAGTAATTAG TTATACTTTA ATATGTTTCT C 



SEQ. ID. NO. 274 

1 MDYHISFHFQ ALLGLLAFVF LSIILWRRTL TSRKLAPEIP GAWPIIGHLR QLSGTDKNIP 

61 FPRILGALAD KYGPVFTLRI GMYPYLIVNN WEAAKDCLTT HDKDLAARPT SMAGESIGYK 

121 YARFTYANFG PYYNQVRKLA LQHVLSSTKL EKMKHIRVSE LETSIKELYS LTLGKNNMQK 

181 VNISKWFEQL TLNIIVKTIC GKRYSNIEED EEAQRFRKAF KGIMFWGQI VLYDAIPFPL 

241 FKYFDFQGHI QLMNKIYKDL DSILQGWLDD HMMNKDVNNK DQDAIDAMLK VTQLNEFKAY 

301 GFSQATVIKS TVLSLILDGN DTTAVHLIWV MSLLLNNPHV MKQGQEEIDM KVGKERWIED 

361 TDIKNLVYLQ AIVKETLRLY PPVPFLLPHE AVQDCKVTGY HIPKGTRLYI NAWKVHRDPE 

421 IWSEPEKFMP NRFLTSKANI DARGQNFEFI PFGSGRRSCP GIGFATLVTH LTFGRLLQGF 

481 DFSKPSNTPI DMTEGVGVTL PKVNQVEVLI TPRLPSKLYL F 
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NAME D205-BE9 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 275 

1 TTTGATTCAA CCATGGAGAA CCAATACTCC TACTCATTCT CTTCCTACTT CTACTTAGCT 
61 ATAGTACTGT TTCTTCTTCC AATTTTGGTC AAATATTTCT TCCATCGGAG AAGAAATTTA 
121 CCTCCAAGTC CATTTTCTCT TCCAATAATT GGTCACCTTT ACCTTCTCAA GAAAACTCTC 
181 CATCTCACTC TAACATCCTT ATCAGCTAAA TATGGTCCTG TTTTATACCT CAAATTGGGC 
241 TCTATGCCTG TGATTGTTGT GTCCTCACCA TCTGCTGTTG AAGAATGTTT AACCAAGAAT 
301 GATATCATAT TCGCAAATAG GCCCAAGACC GTGGCTGGTG ACAAGTTTAC CTACAATTAT 
361 ACTGTTTATG TTTGGGCACC CTATGGCCAA CTTTGGAGAA TTCTTCGCCG ATTAACTGTC 
421 GTTGAACTCT TCTCTTCACA TAGCCTACAG AAAACTTCTA TCCTTAGAGA TCAAGAAGTT 
481 GCAATATTTA TCCGTTCGTT ATACAAATTC TCAAAGGATA GTAGCAAAAA AGTCGATTTG 
541 ACCAACTGGT CTTTTACTTT GGTTTTCAAT CTTATGACCA AAATTATTGC TGGGAGACAT 
601 ATTGTGAAGG AGGAAGATGC TGGCAAGGAA AAGGGCATTG AAATTATTGA AAAACTTAGA 
661 GGGACTTTCT TAGTAACTAC ATCATTCTTG AATATGTGTG ATTTCTTGCC AGTATTCAGG 
721 TGGGTTGGTT ACAAAGGGCA GGAGAAGAAG ATGGCCTCAA TTCACAATAG AAGAAATGAA 
781 TTCTTGAACA GCTTGCTTGA TGAATTTCGA CACAAGAAAA GTAGTGCTTC ACAATCTAAC 
841 ACAACTGTTG GAAACATGGA GAAGAAAACC ACACTGATTG AAAAGCTCTT GTCTCTTCAA 
901 GAATCAGAGC CTGAATTCTA CACTGATGAT ATCATCAAAA GTATTATGCT GGTAGTTTTT 
961 GTTGCAGGAA CAGAGACCTC ATCAACAACC ATCCAATGGG TAATGAGGCT TCTTGTAGCT 
1021 CACCCTGAGG CATTGTATAA GCTACGAGCT GACATTGACA GTAAAGTTGG GAATAAGCGC 
1081 TTGCTGAATG AATCAGACCT CAACAAGCTT CCGTATTTGC ATTGTGTTGT TAATGAGACA 
1141 ATGAGATTAT ACACTCCGAT ACCACTTTTA TTGCCTCATT ATTCAACTAA AGATTGTATT 
1201 GTGGAAGGAT ATGATGTACC AAAACATACA ATGTTGTTTG TCAACGCTTG GGCCATTCAC 
1261 AGGGATCCCA AGGTATGGGA GGAGCCTGAC AAGTTCAAGC CAGAGAGATT TGAGGCAACA 
1321 GAAGGGGAAA CAGAAAGGTT CAATTACAAG CTTGTACCAT TTGGAATGGG GAGAAGAGCG 
1381 TGCCCTGGAG CTGATATGGG GTTGCGAGCA GTTTCTTTGG CATTAGGTGC ACTTATTCAA 
1441 TGCTTTGACT GGCAAATTGA GGAAGCGGAA AGCTTGGAGG AAAGCTATAA TTCTAGAATG 
1501 ACTATGCAGA ACAAGCCTTT GAAGGTTGTC TGCACTCCAC GCGAAGATCT TGGCCAGCTT 
1561 CTATCCCAAC TCTAAGGCAA TTTATCAATG CCAAACGTAA TCTTCATCTA CCACTATG 

SEQ. ID. NO. 276 

1 MENQYSYSFS SYFYLAIVLF LLPILVKYFF HRRRNLPPSP FSLPIIGHLY LLKKTLHLTL 

61 TSLSAKYGPV LYLKLGSMPV IWSSPSAVE ECLTKNDIIF ANRPKTVAGD KFTYNYTVYV 

121 WAPYGQLWRI LRRLTWELF SSHSLQKTSI LRDQEVAIFI RSLYKFSKDS SKKVDLTNWS 

181 FTLVFNLMTK IIAGRHIVKE EDAGKEKGIE IIEKLRGTFL VTTSFLNMCD FLPVFRWVGY 

241 KGQEKKMASI HNRRNEFLNS LLDEFRHKKS SASQSNTTVG NMEKKTTLIE KLLSLQESEP 

301 EFYTDDIIKS IMLWFVAGT ETSSTTIQWV MRLLVAHPEA LYKLRADIDS KVGNKRLLNE 

361 SDLNKLPYLH CWNETMRLY TPIPLLLPHY STKDCIVEGY DVPKHTMLFV NAWAIHRDPK 

421 VWEEPDKFKP ERFEATEGET ERFNYKLVPF GMGRRACPGA DMGLRAVSLA LGALIQCFDW 

481 QIEEAESLEE SYNSRMTMQN KPLKWCTPR EDLGQLLSQL 
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NAME D136-AF4 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 277 

1 CCTTTTTAAG ATGTATTTAA GATTTAAGAT TTAAGATGAA GCAACTGAGG TAAGTCCTTT 
61 CAAGGAGTAG TTGTCACTTC TGAGAATGGA GATGATGTAC AGCATAATAG CAGCAGCCAG 
121 TATTGCAATT ATCTTGGTAT ATACATGGAA AGTGTTGAAT TGGGCTTGGT TTGGGCCGAA 
181 GAAAATGGAG AAATGCTTAA GACAGAGGGG TCTCAAGGGA AATCCTTATA AGCTACTCTA 
241 TGGAGATCTA AACGAACTGA CAAAAAGCAT AATAGAAGCC AAGTCTAAGC CCATCAATTT 
301 CTCTGATGAT ATTGCTCAAA GGCTCATCCC TTTTTTTCTT GACGCCATCA ACAAAAATGG 
361 TAAAAACTCC TTCGTCTGGC TTGGACCGTA TCCAATAGTG TTGATCACGG ATCCTGAGCA 
421 TTTAAAGGAG ATTTTCACAA AGAATTATGT GTATCAAAAG CAAACTCATC CCAATCCATA 
481 CGCCAAGCTA TTAGCTCACG GTCTTGTCAG CCTTGAGGAA GACAAATGGG CCAAACACAG 
541 AAAAATCATT AGTCCTGCCT TCCATGTCGA GAAGCTAAAG CATATGCTGC CTGCATTTTA 
601 TCTGAGTTGT AGTGAAATGA TAAGCAAATG GGAGGAGGTT GTTCCAAAAG AAACATCATT 
661 CGAGCTCGAT GTATGGCCAG ACCTTCAAAT AATGACCAGT GAAGTCATTT CTCGCACTGC 
721 ATTTGGGAGT AGCTATGAAG AAGGAAGAAT AGTATTTGAA CTTCAGAAAG AACAAGCTGA 
781 GTATGTAATG GACATAGGAC GTTCAATTTA TATACCAGGA TCAAGGTTCT TGCCTACTAA 
841 AAGGAACAAA AGAATGCTGG AAATTGAAAA GCAAGTGCAA ACAACAATTA GGCGTATCAT 
901 CGACAAAAGA TTGAAGGCAA TGGAAGAAGG GGAGACTAGT AAAGATGACT TAT TAG G CAT 
961 ATTACTTGAA TCCAATTTGA AAGAAATTGA ACTTCATGGA AGAAATGACT TGGGAATAAC 
1021 AACGTCAGAA GTGATTGAAG AGTGCAAGTT ATTCTATTTT GCCGGCCAAG AGACCACTTC 
1081 AGTGTTGCTT GTTTGGACAA TGATTTTGTT GTGCTTACAT CCAGAGTGGC AAGTACGTGC 
1141 CAGAAAGGAA GTGTTGCAGA TCTTTGGAAA TGATAAACCA GATTTGGAAG GACTAAGTCG 
1201 CTTGAAAATT GTAACAATGA TCTTGTACGA GACGTTACGC CTATTCCCCC CATTACCAGC 
1261 ATTTGGTAGA AGGAACAAAG AAGAAGTCAA ATTAGGGGAG CTACATCTAC CGGCTGGAGT 
1321 GTTACTCGTT ATACCAGCAA TCTTAGTACA TTATGATAAG GAAATATGGG GTGAAGATGC 
1381 AAAGGAATTC AAACCAGAAA GATTCAGTGA AGGAGTGTCA AAGGCAACAA ATGGACAAGT 
1441 CTCATTTATA CCATTTAGCT GGGGACCTCG TGTTTGCATT GGACAAAACT TCGCAATGAT 
1501 GGAAGCAAAA ATGGCAGTAA CTATGATACT ACAAAAATTC TCCTTTGAAC TATCCCCTTC 
1561 TTATACACAT GCTCCATTTG CAATTGTGAC TATTCATCCC CAGTATGGTG CTCCTCTGCT 
1621 TATGCGCAGA CTTTAAAACA TATGTTGCTG ATATTTAAGA TCAGTGGCGT TTTATT 

SEQ. ID. NO. 278 

1 MEMMYSIIAA ASIAIILVYT WKVLNWAWFG PKKMEKCLRQ RGLKGNPYKL LYGDLNELTK 
61 SIIEAKSKPI NFSDDIAQRL IPFFLDAINK NGKNSFVWLG PYPIVLITDP EHLKEIFTKN 
121 YVYQKQTHPN PYAKLLAHGL VSLEEDKWAK HRKIISPAFH VEKLKHMLPA FYLSCSEMIS 
181 KWEEWPKET SFELDVWPDL QIMTSEVISR TAFGSSYEEG RIVFELQKEQ AEYVMDIGRS 
241 IYIPGSRFLP TKRNKRMLEI EKQVQTTIRR IIDKRLKAME EGETSKDDLL GILLESNLKE 
301 IELHGRNDLG ITTSEVIEEC KLFYFAGQET TSVLLVWTMI LLCLHPEWOV RARKEVLQIF 
361 GNDKPDLEGL SRLKIVTMIL YETLRLFPPL PAFGRRNKEE VKLGELHLPA GVLLVIPAIL 
421 VHYDKEIWGE DAKEFKPERF SEGVSKATNG QVSFIPFSWG PRVCIGQNFA MMEAKMAVTM 
481 ILQKFSFELS PSYTHAPFAI VTIHPQYGAP LLMRRL 
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NAME D101-BA2 

ORGANISM N I COT I ANA TABACUM 

SEQ. ID. NO. 279 



1 CTAAATTTCA TATACCTTTA GTACTCTTGA AATTTTCAAA TAATGGTTTA TCTTCTTTCT 
61 CCCATAGAAG CCATTGTAGG ATTTGTAACC TTTTCATTTC TATTCTACTT TCTATGGACC 
121 AAAAAACAAT CAAAAATCTT AAACCCACTA CCTCCAAAAA TCCCAGGTGG ATGGCGAGTA 
181 ATCGGCCATC TCTTTTATTT CAAGAACAAT GGCGATGAAG ATCGCCATTT TTCTCAAAAA 
241 CTCGGTGACT TAGCTGACAA ATATGGTCCC GTCTTCACTT TCCGGTTAGG GTTTCGCCGT 
301 TTCTTGGCGG TGAGTAGTTA TGAAGCTATG AAAGAATGCT TCACTACCAA TGATATCCAT 
361 TTCGCCGATC GGCGATCTTT ACTCTACGGA GAATACCTTT GCTATAATAA TGCCATGCTT 
421 GCTGTTGCCA AATATGGCCC TTACTGGAAA AAAAATCGAA AGTTAGTCAA TCAAGAAGTT 
481 CTCTCCGTTA GTCGGCTCGA AAAATTCAAA CATGTTAGAT TTTCTATAAT TCAGAAAAAT 
541 ATTAAACAAT TGTATAATTG TGATTCACCA ATGGTGAAGA TAAACCTTAG TGATTGGATA 
601 GATAAATTGA CATTCGACAT CATTTTGAAA ATGGTTGTTG GGAAGAACTA TAATAATGGA 
661 CATGGAGAAA TACTCAAAGT TGCTTTTCAG AAATTCATGG TTCAAGCTAT GGAGATGGAG 
721 CTCTATGATG TTTTTCACAT TCCATTTTTC AAGTGGTTGG ATCTTACAGG GAATATTAAG 
781 GCTATGAAAC AAACTTTCAA AGACATTGAT AATATTATCC AAGGTTGGTT AGATGAGCAC 
841 ATTAAGAAGA GAGAAACAAA GGATGTTGGA GGTGAAAACG AACAAGATTT TATAGATGTG 
901 GTGCTTTCCA AGATGAGCGA CGAACATCTT GGCGAGGGTT ACTCTCATGA CACAACCATC 
961 AAAGCAACTG TATTCACTTT GGTCTTGGAT GCAACAGACA CACTTGCACT TCATATAAAG 
1021 TGGGTAATGG C GTTAAT GAT AAACAATAAG CATGTCATGA AGAAAGCACA AGAAGAGATG 
1081 GACACAATTG TTGGTAGAGA TAGATGGGTA GAAGAGAGTG ATATCAAGAA TTTGGTGTAT 
1141 CTCCAAGCAA TTGTTAAAGA AGTATTACGA TTACATCCAC CTGCACCTTT GTCAGTGCAA 
1201 CACCTATCTG TGGAAGATTG TGTTGTCAAT GGGTACGATA TTCCTAAGGG GACTGCACTA 
1261 CTTACCAATA TTATGAAACT ACAGCGAGAT CCTCAAACAT GGCCAAATCC TGATAAATTC 
1321 GATCCAGAGA GATTCCTGAC GACTCATGCT ACTATTGACT ACCGCGGGCA GCACTATGAG 
1381 TTGATCCCCT TTGGTACGGG GAGACGAGCT TGTCCCGCGA TGAATTATTC ATTGCAAGTG 
1441 GAACACCTTT CAATTGCTCA TATGATCCAA GGTTTCAGTT TTGCAACTAC GACCAATGAG 
1501 CCTTTGGATA TGAAACAAGG TGTGGGTTTA ACTTTACCAA AGAAGACTGA TGTTGAAGTT 
1561 CTAATTACCC CTCGTTT 

SEQ. ID. NO. 280 

1 MVYLLSPIEA IVGFVTFSFL FYFLWTKKQS KILNPLPPKI PGGWPVIGHL FYFKNNGDED 
61 RHFSQKLGDL ADKYGPVFTF RLGFRRFLAV SSYEAMKECF TTNDIHFADR PSLLYGEYLC 
121 YNNAMLAVAK YGPYWKKNRK LVNQEVLSVS RLEKFKHVRF SIIQKNIKQL YNCDSPMVKI 
181 NLSDWIDKLT FDIILKMWG KNYNNGHGEI LKVAFQKFMV QAMEMELYDV FHIPFFKWLD 
241 LTGNIKAMKQ TFKDIDNIIQ GWLDEHIKKR ETKDVGGENE QDFIDWLSK MSDEHLGEGY 
301 SHDTTIKATV FTLVLDATDT LALHIKWVMA LMINNKHVMK KAQEEMDTIV GRDRWVEESD 
361 I KNLVYLQAI VKEVLRLHPP APLSVQHLSV EDCWNGYHI PKGTALLTNI MKLQRDPQTW 
421 PNPDKFDPER FLTTHATIDY RGQHYELIPF GTGRRACPAM NYSLQVEHLS IAHMIQGFSF 
481 ATTTNEPLDM KQGVGLTLPK KTDVEVLITP R 



FIG. 141 



NAME D130-AA1 
ORGANISM NI COT I ANA TABACUM 
SEQ. ID. NO. 281 

1 CTTTTTCTCC CCAAAAAAGA GCTCATTTCC CTTGTCCCCA AAAATGGATC TTCTCTTACT 
61 AGAGAAGACC TTAATTGGTC TCTTCTTTGC CATTTTAATC GCTGTAATTG TCTCTAGACT 
121 TCGTTCAAAG CGTTTTAAGC TTCCCCCAGG ACCAATCCCA GTACCAGTTT TTGGTAATTG 
181 GCTTCAAGTT GGTGATGATT TAAACCACAG AAATCTTACT GATTTTGCCA AAAAATTTGG 
241 TGATCTTTTC TTGTTAAGAA TGGGCCAGCG TAATTTAGTT GTTGTGTCAT CTCCTGAATT 
301 AGCTAAAGAA GTTTTACACA CACAAGGTGT TGAATTTGGT TCAAGAACAA GAAATGTTGT 
361 ATTTGATATT TTTACTGGAA AAGGT CAAGA TATGGTTTTT ACTGTATATG GTGAACACTG 
421 GAGAAAAATG AGGAGAATTA TGACTGTACC ATTTTTTACT AATAAAGTTG TGCAGCAATA 
481 TAGAGGGGGG TGGGAGTTTG AAGTGGCAAG TGTAATTGAG GATGTGAAGA AAAATCCTGA 
541 ATCTGCTACT AATGGGATTG TATNAAGGAG GAGATTACAA TTGATGATGT ATAATAATAT 
601 GTTTAGGATT ATGTTTGATA GGAGATTTGA GAGTGAAGAT GATCCTTTGT TTGTTAAGCT 
661 TAAGGCTTTG AATGGTGAAA GGAGTAGATT GGCTCAGAGT TTTGAGTATA ATTATGGTGA 
721 TTTTATTCCC ATTTTGAGGC CTTTTTTGAG AGGTTATTTG AAGATCTGTA AAGAAGTTAA 
781 GGAGAAGAGG CTGCAGCTTT TCAAAGATTA CTTTGTTGAT GAAAGAAAGA AGCTTTCAAA 
841 TACCAAGAGC TTGGACAGCA ATGCTCTGAA ATGTGCGATT GATCACATTC TTGAGGCTCA 
901 ACAGAAGGGG GAGATCAATG AGGACAACGT TCTTTACATT GTTGAAAACA TCAATGTTGC 
961 TGCTATAGAA ACCACATTAT GGTCAATTGA GTGGGGTATC GCCGAGTTAG TCAACCACCC 
1021 TCACATCCAA AAGAAACTCC GCGACGAGAT TGACACAGTT CTTGGCCCAG GAGTGCAAGT 
1081 GACTGAACCA GACACCCACA AGCTTCCATA CCTTCAGGCT GTGATCAAGG AGACGCTTCG 
1141 TCTCCGTATG GCAATTCCTC TATTAGTCCC ACACATGAAC CTTCACGATG CAAAGCTTGG 
1201 CGGGTTTGAT ATTCCAGCAG AGAGCAAAAT CTTGGTTAAC GCTTGGTGGC TAGCTAACAA 
1261 CCCGGCTCAT TGGAAGAAAC CCGAAGAGTT CAGACCCGAG AGGTTCTTCG AAGAGGAGAA 
1321 GCACGTTGAG GCCAATGGCA ATGACTTCAG ATATCTTCCG TTTGGCGTTG GTAGGAGGAG 
1381 TTGCCCTGGA ACTATACTTG CATTGCCAAT TCTTGGCATT ACTTTGGGAC GTTT 

SEQ. ID. NO. 282 

1 MDLLLLEKTL IGLFFAILIA VIVSRLRSKR FKLPPGPIPV PVFGNWLQVG DDLNHRNLTD 
61 FAKKFGDLFL LRMGQRNLW VSSPELAKEV LHTQGVEFGS RTRNWFDIF TGKGQDMVFT 
121 VYGEHWRKMR RIMTVPFFTN KWQQYRGGW EFEYASVIED VKKNPESATN GIVLRRRLQL 
181 MMYNNMFRIM FDRRFESEDD PLFVKLKALN GERSRLAQSF EYNYGDFIPI LRPFLRGYLK 
241 ICKEVKEKRL QLFKDYFVDE RKKLSNTKSL DSNALKCAID HILEAQQKGE INEDNVLYIV 
301 ENINVAAIET TLWSIEWGIA ELVNHPHIQK KLRDEIDTVL GPGVQVTEPD THKLPYLQAV 
361 IKETLRLRMA IPLLVPHMNL HDAKLGGFDI PAESKILVNA WWLANNPAHW KKPEEFRPER 
421 FFEEEKHVEA NGNDFRYLPF GVGRRSCPGT ILALPILGIT LGR 
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NAME D136-AD5 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 283 

1 CCAAATTAGA GCAAGAAATT AACAAGTCTA GTTACCTTCT CCCTTTTTAA GAGTATTTAA 
61 GATTTAAGAT TTAAGATGAA GCAACTGAGG TAAGTCCTTT CAAGGAGTAG TTGTCACTTC 
121 TGAGAATGGA GAT GAT GT AC AGCATAATAG CAGCAGCCAG TATTGCAATT ATCTTGGTAT 
181 ATACATGGAA AGTGTTGAAT TGGGCTTGGT TTGGGCCAAA GAAAATGGAG AAATGCTTAA 
241 GACAGAGGGG TCTCAAGGGA AATCCTTATA AGCTACTCTA TGGAGATCTA AACGAACTGA 
301 CAAAAAGCAT AATAGAAGCC AAGTCTAAGC CCATCAATTT CTCTGATGAT ATTGCTCAAA 
361 GGCTCATCCC TTTTTTTCTT GACGCCATCA ACAAAAATGG TAAAAACTCC TTCGTCTGGC 
421 TTGGACCGTA TCCAATAGTG TTGATCACGG ATCCTGAGCA TTTAAAGGAG ATTTTCACAA 
481 AGAATTATGT GTATCAAAAG CAAACTCATC CCAATCCATA CGCCAAGCTA TTAGCTCACG 
541 GTCTTGTCAG CCTTGAGGAA GACAAATGGG CCAAACACAG AAAAATCATT AGTCCTGCCT 
601 TCCATGTCGA GAAGCTAAAG CATATGCTGC CTGCATTTTA TCTGAGTTGT AGTGAAATGA 
661 TAAGCAAATG GGAGGAGGTT GTTCCAAAAG AAACATCATT CGAGCTCGAT GTATGGCCAG 
721 ACCTTCAAAT AATGACCAGT GAAGTCATTT CTCGCACTGC ATTTGGGAGT AGCTAT GAAG 
781 AAGGAAGAAT AGTATTTGAA CTTCAGAAAG AACAAGCTGA GTATGTAATG GACATAGGAC 
841 GTTCAATTTA TATACCAGGA TCAAGGTTCT TGCCTACTAA AAGGAACAAA AGAATGCTGG 
901 AAATTGAAAA GCAAGTGCAA ACAACAATTA GGCGTATCAT CGACAAAAGA TTGAAGGCAA 
961 TGGAAGAAGG GGAGACTAGT AAAGAT GACT TATTAGGCAT ATTACTTGAA TCCAATTTGA 
1021 AAGAAATTGA ACTTCATGGA AGAAATGACT TGGGAATAAC AACATCAGAA GT GATT GAAG 
1081 AGTGCAAGTT AATCTATTTT GCCGGCCAAG AGACCACTTC AGTGTTGCTT GTTTGGACAA 
1141 TGATTTTGTT GTGCTTACAT CCAGAGTGGC AAGTACGTGC CAGAAAGGAA GTGTTGCAGA 
1201 CCTTTGGAAA TGATAAACCA GATTTGGAAG GACTAAGTCG CTTGAAAATT GTAACAATGA 
1261 TCTTGTACGA GACGTTACGC CTATTCCCCC CATTACCAGC ATTTGGTAGA AGGAACAAAG 
1321 AAGAAGTCAA ATTAGGGGAG CTACATCTAC CGGCTGGAGT GTTACTCGTT ATACCAGCAA 
1381 TCTTAGTACA TTATGATAAG GAAATATGGG GTGAAGATGC AAAGGAATTC AAACCAGAAA 
1441 GATTCAGTGA AGGAGTGTCA AAGGCAACAA ATGGACAAGT CTCATTTATA CCATTTAGCT 
1501 AGGGACCTCG TGTTTGCATT GGACAAAACT TCGCAATGAT GGAAGCAAAA ATGGCAGTAA 
1561 CT AT GAT ACT AC AAAAAT T C TCCTTTGAAC TATCCCCTTC TTATACACAT GCTCCATTTG 
1621 CAATTGTGAC TATTCATCCC CAGTATGGTG CTCCTCTGCT TATGCGCAGA CTTTAAAACA 
1681 TATGTTGCTG ATATTTAAGA TCAGTGGCGT TTTATTCTCC ATG 

SEQ. ID. NO. 284 

1 MEMMYSIIAA ASIAIILVYT WKVLNWAWFG PKKMEKCLRQ RGLKGNPYKL LYGDLNELTK 
61 SIIEAKSKPI NFSDDIAQRL IPFFLDAINK NGKNSFVWLG PYPIVLITDP EHLKEIFTKN 
121 YVYQKQTHPN PYAKLLAHGL VSLEEDKWAK HRKIISPAFH VEKLKHMLPA FYLSCSEMIS 
181 KWEEWPKET SFELDVWPDL QIMTSEVISR TAFGSSYEEG RIVFELQKEQ AEYVMDIGRS 
241 IYIPGSRFLP TKRNKRMLEI EKQVQTTIRR I I DKRLKAME EGETSKDDLL GILLESNLKE 
301 IELHGRNDLG ITTSEVIEEC KLIYFAGQET TSVLLWTMI LLCLHPEWQV RARKEVLQTF 
361 GNDKPDLEGL SRLKIVTMIL YETLRLFPPL PAFGRRNKEE VKLGELHLPA GVLLVIPAIL 
421 VHYDKEIWGE DAKEFKPERF SEGVSKATNG QVSFIPFS 
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NAME D138-AD12 
ORGANISM N I COT I AN A TABACUM 
SEQ. ID. NO. 285 

1 TTTGCCTTTG CTCGTCATTG ATGACGACTT CATTTTGTTT TCTTCCCCAC GAAAATGGTA 
61 GATATGATAT GGAGGGACGT AGGGAAGAAT TACTGGGACA AACCTAGTGA GTGAAAATGG 
121 AAACAGT T GA AATGATAGTA AAAGTATCTT GTGCTGCCAT AGTAATTACT CTGTTGGTGT 
181 GTCTATGGAG AGTGCTGAAT TGGGTTTGGT TCAGACCAAA GAAATTAGAG AAGTTGTTGA 
241 GAAAACAGGT TTTGTATGGG GACATGAAAG AGTTTTCTGG GATGATTAAG GAAGCATACT 
301 CAAAGCCTAT GAGTCTGTCT GATGATGTAG CACCACGAAT GATGCCTTTC TTTCTTGAAA 
361 CCATCAAGAA ATATGGAAAA AGATCCTTTA TATGGTTCGG TCCAAGACCA CTAGTATTGA 
421 TCATGGATCC TGAGCTTATA AAGGAAGTAC TCTCCAAAAT CTATCTTTAT CAAAAGCCCG 
4 81 GTGGAAATCC ATTAGCAACA CTATTGGTAC AAGGATTAGC AACCTATGAG GAAGACAAAT 
541 GGGCCAAACA TAGAAAAATC ATCAATCCCG CTTTCCATCT AGAGAAGCTA AAGCATATGC 
601 TTCCAGCTTT TCGCTTGAGC TGTAGTGAGA TGCTGAGCAA ATGGGAAGAC ATTGTTTCAG 
661 CTGAAGGCTC ACATGAGATA GATGTATGGC CTAACCTTGA GCAATTGAGT TGCGATGTGA 
721 TCTCTCGGAC AGCTTTTGGC AATAGTTATG AAGAAGGTAG AAAGATTTTT GAACTTCAAA 
781 AGGAACAAAC TCAGCATCTT GTGGAAGCTT TCCGCTCTGT TTATATCCCA GGAAGGAGAT 
841 TTTTGCCAAC AAAGAGGAAT AGAAGAATGA AGGAAATAAA AAAGGAGGTT CGAGCGTCAA 
901 TTAAAGGTAT TAT T GAT AAA AGATTGAAGG CAATGAAAGC AGGGGACACC AATAATGAGG 
961 ATCTATTGGG ATATTGCTGG AATCAAATTT TAAAGAAATT GAACAGCGCG GAAACAAGGA 
1021 TTTTGGAATG AGCATTGAAG ATGTCATTGA AGAATGCAAG TTATTCTATT TTGCTGGCCA 
1081 AGAAACTACA TCAGTGTTGC TCCTATGGTC TCTAGTGTCG TTGAGCAGGT ATCAAGATTG 
1141 GCAGACACGG GCCAGAGAAG AAGTCTTGCA TGTCTTTGGG AGTCGGAAAC CAGATTTTGA 
1201 TGAATTAAAT CATCTAAAAG TTGTGACAAT GATCATGTAC GAGTCTTTAA GGCTATATCC 
1261 CTCACTAATA ACACTTACCC GCCGGTGTAA TGAAGACATT GTATTAGGAG AACTATCTCT 
1321 ACCAGCTGGT GTCCTAGTCT CTTTGCCAAT GATTTTGTTG CATCATGATG AAGAGATATG 
1381 GGGTGAAGAT GCAAAGGAGT TCAAACCAGA GAGATTTAGA GAAGGATTGT CAAGTGCAAC 
1441 AAAGGGTCAA CTTACATATT TTCCATTTGG CTGGGGTCCT AGAATATGTA TTGGACAAAA 
1501 TTTTGCCATG TTAGAAGCAA AGATGGCTCT GTCTATGATC CTGCAACGCT TCTCTTTTGA 
1561 ACTGTCTCCG TCTTATGCAC ATGCCCCTCA GTCCATATTA ACCGTTCAGC CTCAATATGG 
1621 TGCTCCACTT ATTTTCCACA AGCTATAATT TGGTACTTGT GAAAGGTGTC TTGTACAATA 
1681 TGTTAGTAGA GTTTATTCAG ACTTAGATAC ATGCTTC 

SEQ. ID. NO. 286 

1 METVEMIVKV SCAAIVITLL VCLWRVLNWV WFRPKKLEKL LRKQVLYGDM KEFSGMIKEA 

61 YSKPMSLSDD VAPRMMPFFL ETIKKYGKRS FIWFGPRPLV LIMDPELIKE VLSKIYLYQK 

121 PGGNPLATLL VQGLATYEED KWAKHRKIIN PAFHLEKLKH MLPAFRLSCS EMLSKWEDIV 

181 SAEGSHEIDV WPNLEQLSCD VISRTAFGNS YEEGRKIFEL QKEQTQHLVE AFRSVYIPGR 

241 RFLPTKRNRR MKEIKKEVRA SIKGIIDKRL KAMKAGDTNN EDLLGYCWNQ ILKKLNSAET 

301 RILE 
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NAME D216-AG8 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 287 

1 CCAAAATGCA GTTCTTCAAC TTCATTTCCT TTGTCTTTTT TGTGTCTTTC CTCTTTTTAT 
61 TAAGGAAATG GAAGAACTCC AATAGCCAAA CCAAAAGATT GCCTCCAGGT CCATGGAAAT 
121 TACCTGTACT TGGAAGCATG TTTCATTTGC TAGGTGGACC TCCACATCAT GTCCTTGGAG 
181 ATTTAGCCAA AAAATATGGT CCACTTATGC ACCTTCAACT AGGTGAAGTT TCTGTAGTTT 
241 CTGTTACTTC TCCTGAGATG GCAAAAGAAG TACTAAAAAC TCATGACCTC GCTTTTGCAT 
301 CTAGGCCGTT ACTTTTGGCA GCCAAAATTG TCTGCTATAA TGGGACAGAC ATTGTCTTTT 
361 CCCCCTATGG CGATTATTGG AGACAAACGC GTAAAATTTG TCTCTTGGAA TTGCTCAGTG 
421 CCAAAAATGT TAGGTCATTC AGCTCAGTCA GACGAGATGA AGTTTTCCAT ATGATTGAAT 
481 TTTTTTCGAT CATCTTCTGG TAAGCCAGTT AATGTATCAA AAAGGATTTC TCTATTCACA 
541 ACCTCTATGA CATGTAGATC AGCCTTTGGA CAAGAATACA AGGAGCAAGA CGAATTCGCA 
601 CAACTAGTAA AAAAAGTGTC AAGCTTAATG GAAGGGTTTG ATGTTGCTGA TATATTCCCT 
661 TCATTGAAGT TTCTTCATGT GCTCAGTGGA ATGAAGGCTA AAGTTATGGA TGCACACCAT 
721 GAGTTAGATG CCATTCTTGA AAAAATTATC AAT GAGCACA AGAAAATTGC AACTGGAAAG 
781 AAT AAT AAT G AATTAGGAGG TGAAGGATTA ATTGACGTAC TGCTAAGACT TATGAAAGAG 
841 GGAGGCCTTC AATTCCCGAT CACCAACGAC AACATCAAAG CTATTATTTT TGACATGTTT 
901 GGTGCGGGAA CGGAAACTTC ATCAACCACA ATTGACTGGG CCATGGTCGA AATGATAAAG 
961 AATCCAAGTG TATTCGCTAA AGCTCAAGCA GAGGTAAGAG AAGCCTTCAG AGAGAAAGAA 
1021 ACTTTTGATG AAAATGATGT CGAGGAGTTG AAATACTTAA AATTGGTTAT CAAAGAAACT 
1081 TTCAGACTCC ATCCTCCATT TCCCCTTTTG CTCCCAAGAG AATCTAGAGA AGAAACAGAT 
1141 ATAAACGGCT ACACTATTCC TTTTAAAACA AAACTTATGG TTAACGTTCG GGCTATTGGA 
1201 AGAGATCCAA AATATTGGGA TGACGTGGAA AGTTTTAAGC CAGAGAGATT TGAGCACAAC 
1261 TCTATGGATT TTATTGGTAA TAATTTTGAA TATCTTCCCT TTGGTAGTGG AAGGAGAATG 
1321 TGCCCTGGGA TATCATTTGG TTTGGCTAAT GTTTATTTGC CACTAGCTCA ATTGTTATAT 
1381 CATTTTGATT GGAAACTCCC TACTGGAATC AATTCAAGTG ACTTGGACAT GACTGAGTCG 
1441 TCAGGAGTAA CTTGTGCTAG AAAGAGTGAT TTATACTTGA CTGCTACTCC ATATCAACTT 
1501 TCTCAAGAGT GATGCAATGA TATCAACCTT TTGAATTTCG GTCAACCCCA CCAATAGTG 

SEQ. ID. NO. 288 

1 MQFFNFISFV FFVS FLFLLR KWKNSNSQTK RLPPGPWKLP VLGSMFHLLG GPPHHVLGDL 
61 AKKYGPLMHL QLGEVSWSV TSPEMAKEVL KTHDLAFASR PLLLAAKIVC YNGTDIVFSP 
121 YGDYWRQTRK ICLLELLSAK NVRSFSSVRR DEVFHMIEFF SIIFW 



FIG. 145 



NAME D243-AB3 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 289 

1 CCCCACCAAA AAATCATTTC TCTCGTCTAA AATGGATCTT CTCTTACTAG AGAAGACCTT 
61 AATTGGTCTT TTCTTTGCCA TTTTAATCGC TTTAATTGTC TCTAAACTTC GTTCAAAGCG 
121 TTTTAAGCTT CCTCCAGGAC CAATTCCAGT ACCAGTTTTT GGTAATTGGC TTCAAGTTGG 
181 TGATGATTTA AACCACAGAA ATCTTACTGA TTATGCCAAG AAATTTGGAG ATCTTTTCTT 
241 GTTAAGAATG GGTCAACGTA ACTTAGTTGT TGTGTCATCT CCTGAATTAG CTAAAGAAGT 
301 TTTACACACA CAAGGTGTTG AATTTGGTTC AAGAACAAGA AATGTTGTGT TTGATATTTT 
361 TACTGGAAAA GGTCAAGATA TGGTTTTTAC TGTATATGGT GAACATTGGA GAAAAATGAG 
421 GAGAATTATG ACTGTACCAT TTTTTACTAA TAAAGTTGTG CAACAGTATA GAGGGGGGTG 
481 GGAGTTTGAG GTGGCAAGTG TAATTGAGGA TGTGAAAAAA AATCCTGAAT CTGCTACTAA 
541 TGGGATCGTA TTAAGGAGGA GATTACAATT AATGATGTAT AATAATATGT TTAGGATTAT 
601 GTTTGATAGG AGATTTGAGA GTGAAGATGA TCCTTTGTTT GTTAAGCTTA AGGCTTTGAA 
661 TGGTGAAAGG AGTAGATTGG CTCAAAGTTT TGAGTATAAT TATGGTGATT TTATTCCAAT 
721 TTTGAGGCCT TTTTTTGAGA GGTTATTTGA AGATCTGTAA AGAAGTTAAG GAGAAGAGGC 
781 TGCAGCTTTT CAAAGATTAC TTTGTTGATG AAAGAAAGAA GCTTTCGAAT ACCAAGAGCT 
841 CGGACAGCAA TGCCCTAAAA TGTGCGATTG ATCACATTCT TGAGGCTCAA CAGAAGGGAG 
901 AGATCAATGA GGACAACGTT CTTTACATTG TTGAAAACAT CAATGTTGCT GCAATTGAAA 
961 CAACATTATG GTCAATTGAG TGGGGTATCG CCGAGCTAGT CAACCACCCT CACATCCAAA 
1021 AGAAACTGCG CGACGAGATT GACACAGTTC TTGGACCAGG AGTGCAAGTG ACTGAACCAG 
1081 ACACCCACAA GCTTCCATAC CTTCAGGCTG TGATCAAGGA GGCACTTCGT CTCCGTATGG 
1141 CAATTCCTCT ATTAGTCCCA CACATGAACC TTCACGACGC AAAGCTTGGC GGGTTTGATA 
1201 TTCCAGCAGA GAGCAAAATC TTGGTTAACG CTTGGTGGTT AGCTAACAAC CCGGCTCATT 
1261 GGAAGAAACC CGAAGAGTTC AGACCCGAGA GGTTCTTTGA AGAGGAGAAG CATGTTGAGG 
1321 CCAATGGCAA TGACTTCAGA TATCTTCCGT TTGGCGTTGG TAGGAGGAGC TGCCCTGGAA 
1381 TTATACTTGC ATTGCCAACT CTTGGCATCA CTTTGGGACG TTTGGTTCAG AACTTTGAGC 
1441 TGTTGCCTCC TCCAGGCCAG TCGAAGCTCG ACACCACAGA GAAAGGTGGA CAGTTCAGTC 
1501 TCCACATTTT GAAGCATTCC ACCATTGTGT TGAAACCAAG GTCTTTCTGA ACTTTGTGAT 
1561 CTTATTAATT AAGGGGTTCT GAAGAAATTT GATAGTGTTG G 

SEQ. ID. NO. 290 

1 MDLLLLEKTL IGLFFAILIA LIVSKLRSKR FKLPPGPIPV PVFGNWLQVG DDLNHRNLTD 
61 YAKKFGDLFL LRMGQRNLW VSSPELAKEV LHTQGVEFGS RTRNWFDIF TGKGQDMVFT 
121 VYGEHWRKMR RIMTVPFFTN KWQQYRGGW EFEVASVIED VKKNPESATN GIVLRRRLQL 
181 MMYNNMFRIM FDRRFESEDD PLFVKLKALN GERSRLAQSF EYNYGDFIPI LRPFFERLFE 
241 DL 
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NAME D250-AC11 
ORGANISM N I COT I ANA 
SEQ. ID. NO. 291 

1 ATAATGCTCT 
61 TTCAAAAATG 
121 AATTTGCATC 
181 TATGGCAAAA 
241 AAATTAGCAA 
301 CTTGGCCAAC 
361 TAT T GGAGAG 
421 TTATTTAGTC 
481 GCTTCTACTT 
541 ATTTGTAGAG 
601 GATTTTCTTT 
661 CCCTTTTTAA 
721 AAGGATTTGG 
781 AAATATATGG 
841 CCACTTGATC 
901 TCAGACACTA 
961 GCCATGGAAA 
1021 GAAGAAGATG 
1081 TATCCACCAG 
1141 TATGAAATTC 
1201 GAAATAT GGG 
1261 TACAAGGGTC 
1321 ATTGCACTTG 
1381 TGGGAGTTGC 
1441 GCCATGCACA 
1501 GGGACGTGGA 

SEQ. ID. NO. 292 

1 MLFLLFVALP 
61 GKIFSLKLAS 
121 WREMRKICVL 
181 CRVAFGVRFE 
241 DLDNFYEELI 
301 DTSAAATVWA 
361 PPAPLLVPRE 
421 KGQDFELLPF 
481 MHKKNELCLV 



TABACUM 



TTCTACTCTT 
GTGGAAATAA 
AATATGATAG 
TCTTCTCATT 
AAGAAGTATT 
AAAAACTGTC 
AAATGAGAAA 
CAATTCGTGA 
CACAAATTAT 
TTGCTTTTGG 
TGGCCGAGGC 
GTTAGATTGA 
ATAATTTTTA 
AAGGAGATAT 
TCACTATGGA 
GTGCAGCTGC 
AAGTTCAATT 
TCCAAAACAT 
CTCCACTTTT 
GGCCAAGAAC 
AAAATCCAGA 
AAGATTTTGA 
GGGTTGCATC 
CTTATGGAGT 
AGAAAAACGA 
TCTCATGCTA 



TGTAGCCCTT 
CAGATTGCCA 
TATAACTCCT 
AAAACTTGCT 
GAAAAAACAA 
TTATTATGGT 
AATTTGTGTT 
AGATGAAGTT 
TAATTTGAGT 
TGTTAGGTTT 
ACAAGAAATG 
CAAATTAAGT 
TGAAGAACTC 
TGTTGATCTT 
AGATATAAAA 
TACTGTTTGG 
AGAAATCAGA 
CCCTTATTTT 
AGTTCCAAGA 
CATAGTTCAT 
TGAATTTATA 
GTTACTTCCA 
CATGGAACTT 
GAAAAAAGAA 
ACTTTGCCTT 
GTTCTGTGCG 



CCTTTCATTC 
CCAGGTCCTA 
CATATCTATT 
TCTACTAATG 
GATTTAATAT 
CGTGATATTG 
CTTCATCTTT 
TTTAGAATGA 
AATTTAATGA 
GAAGAAGAAG 
ATGGCTAGTT 
GGATTGACAT 
ATTGAGCAAC 
TTGCTACAAT 
GGAATTCTCA 
GCAATGACAG 
AAATCAGTTG 
AAAGCAGT GA 
GAATCAATGG 
GTTAACGCTT 
CCTGAGAGAT 
TTTGGTGCAG 
GCTTTGTCAA 
GACATCGACA 
GTCCCAAAAA 
GTCAGCTAAG 



TTATTTTTCT 
TAGGTTTACC 
TTTGGAAACT 
TGGTAGTAGT 
TTTGTAGTAG 
CTTTTGCACC 
TTAGTTTAAA 
TTAAGAAAAT 
TTTCATTAAC 
CACATGCAAG 
TCTTTGTATC 
ATAGACTTGA 
ATCAAAATCC 
TGAAGAAAGA 
TGAATGTGTT 
CCTTGATAAA 
GGAAGAAAGG 
TAAAGGAAAT 
AAAAAACCAT 
GGGCTATAGC 
TTTTGAATAG 
GCAGAAGAGG 
ATCTTCTTTA 
CAAACGTTAG 
AATTATTTAT 
CTTA 



TCTTCCTAAA 
ATTCATTGGA 
TTCCAAAAAA 
TTCTTCAGCA 
ACCATCTATT 
TTATAATGAT 
AAAAGTTCAA 
ATCAAAACAA 
AAGTACAATT 
GAAGAGATTT 
TGATTTTTTT 
GAGGAATTTC 
TAATAAGCCA 
GAAATTAACA 
AGTTGCAGGA 
GAATCCTAAA 
CATTGTAAAT 
ATTTAGATTG 
ATTAGAAGGT 
AAGGGATCCT 
CAGTATCGAT 
TTGCCCAGGT 
TGCATTTGAT 
GCCTGGAATT 
AAATTATATT 



FILIFLLPKF 
TNWWSSAK 
HLFSLKKVQL 
EEAHARKRFD 
EQHQNPNKPK 
MTALIKNPKA 
SMEKTILEGY 
GAGRRGCPGI 
PKKLFINYIG 



KNGGNNRLPP 
LAKEVLKKQD 
FSPIREDEVF 
FLLAEAQEMM 
YMEGDIVDLL 
MEKVQLEIRK 
EIRPRTIVHV 
ALGVASMELA 
TWISC 



GPIGLPFIGN 
LIFCSRPSIL 
RMIKKISKQA 
ASFFVSDFFP 
LQLKKEKLTP 
SVGKKGIVNE 
NAWAIARDPE 
LSNLLYAFDW 



LHQYDSITPH 
GQQKLSYYGR 
STSQIINLSN 
FLS. IDKLSG 
LDLTMEDIKG 
EDVQNIPYFK 
IWENPDEFIP 
ELPYGVKKED 



IYFWKLSKKY 
DIAFAPYNDY 
LMISLTSTII 
LTYRLERNFK 
ILMNVLVAGS 
AVIKEIFRLY 
ERFLNSSIDY 
IDTNVRPGIA 
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NAME D205-AH4 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 293 

1 GTGAGGTTTG AATCCTCTGC CTCAATGAAA CTCACCAAAT TGGTTTTCTA ATTTCCATCT 
61 AAAATATTGT CCAAAGCTAA AGATTCTTTC TCCTTAAATA GTCAACTTTA GTGGTTCCTC 
121 TTCATTTCAT AGCTCAATCT TTCTTATTTT GATTCAACCA TGGAGAACCA ATACTCCTAC 
181 TCATTCTCTT CCTACTTCTA CTTAGCTATA GTACTGTTTC TTCTTCCAAT TTTGGTCAAA 
241 TATTTCTTCC ATCGGAGAAG AAATTTACCT CCAAGTCCAT TTTCTCTTCC AATAATTGGT 
301 CACCTTTACC TTCTCAAGAA AACTCTCCAT CTCACTCTAA CATCCTTATC AGCTAAATAT 
361 GGTCCTGTTT TATACCTCAA ATTGGGCTCT ATGCCTGTGA TTGTTGTGTC CTCACCATCT 
421 GCTGTTGAAG AATGTTTAAC CAAGAATGAT AT CAT ATT CG CAAATAGGCC CAAGACCGTG 
481 GCTGGTGACA AGTTTACCTA CAATTATACT GTTTATGTTT GGGCACCCTA TGGCCAACTT 
541 TGGAGAATTC TTCGCCGATT AACTGTCGTT GAACTCTTCT CTTCACATAG CCTACAGAAA 
601 ACTTCTATCC TTAGAGATCA AGAAGTTGCA ATATTTATCC GTTCGTTATA CAAATTCTCA 
661 AAGGATAGTA GCAAAAAAGT CGATTTGACC AACTGGTCTT TTACTTTGGT TTTCAATCTT 
721 ATGACCAAAA TTATTGCTGG GAGACATATT GTGAAGGAGG AAGATGCTGG CAAGGAAAAG 
781 GGCATTGAAA TTATTGAAAA ACTTAGAGGG ACTTTCTTAG TAACTACATC ATTCTTGAAT 
841 ATGTGTGATT TCTTGCCAGT ATTCAGGTGG GTTGGTTACA AAGGGCTGGA GAAGAAGATG 
901 GCCTCAATTC ACAATAGAAG AAATGAATTC TTGAACAGCT TGCTTGATGA ATTTCGACAC 
961 AAGAAAAGTA GTGCTTCACA ATCTAACACA ACTGTTGGAA ACATGGAGAA GAAAACCACA 
1021 CTGATTGAAA AGCTCTTGTC TCTTCAAGAA TCAGAGCCTG AATTCTACAC TGATGATATC 
1081 ATCAAAAGTA TTATGCTGGT AGTTTTTGTT GCAGGAACAG AGACCTCATC AACAACCATC 
1141 CAATGGGTAA TGAGGCTTCT TGTAGCTCAC CCTGAGGCAT TGTATAAGCT ACGAGCTGAC 
1201 ATTGACAGTA AAGTTGGGAA TAAGCGCTTG CTGAATGAAT CAGACCTCAA CAAGCTTCCG 
1261 TATTTGCATT GTGTTGTTAA TGAGACAATG AGATTATACA CTCCGATACC ACTTTTATTG 
1321 CCTCATTATT CAACTAAAGA TTGTATTGTG GAAGGATATG ATGTACCAAA ACATACAATG 
1381 TTGTTTGTCA ACGCTTGGGC CATTCACAGG GATCCCAAGG TATGGGAGGA GCCTGACAAG 
1441 TTCAAGCCAG AGAGATTTGA GGCAACAGAA GGGGAAACAG AAAGGTTCAA TTACAAGCTT 
1501 GTACCATTTG GAATGGGGAG AAGAGCGTGC CCTGGAGCTG ATATGGGGTT GCGAGCAGTT 
1561 TCTTTGGCAT TAGGTGCACT TATTCAATGC TTTGACTGGC AAATTGAGGA AGCGGAAAGC 
1621 TTGGAGGAAA GCTATAATTC TAGAATGACT ATGCAGAACA AGCCTTTGAA GGTTGTCTGC 
1681 ACTCCACGCG AAGATCTTGG CCAGCTTCTA TCCCAACTCT AAGGCAATTT ATCAATGCCA 
1741 AACGTAATCT TCATCTACCA CTATG 

SEQ. ID. NO. 294 

1 MENQYSYSFS SYFYLAIVLF LLPILVKYFF HRRRNLPPSP FSLPIIGHLY LLKKTLHLTL 
61 TSLSAKYGPV LYLKLGSMPV IWSSPSAVE ECLTKNDIIF ANRPKTVAGD KFTYNYTVYV 
121 WAPYGQLWRI LRRLTWELF SSHSLQKTSI LRDQEVAIFI RSLYKFSKDS SKKVDLTNWS 
181 FTLVFNLMTK IIAGRHIVKE EDAGKEKGIE IIEKLRGTFL VTTSFLNMCD FLPVFRWVGY 
241 KGLEKKMASI HNRRNEFLNS LLDEFRHKKS SASQSNTTVG NMEKKTTLIE KLLSLQESEP 
301 EFYTDDIIKS IMLWFVAGT ETSSTTIQWV MRLLVAHPEA LYKLRADIDS KVGNKRLLNE 
361 SDLNKLPYLH CWNETMRLY TPIPLLLPHY STKDCIVEGY DVPKHTMLFV NAWAIHRDPK 
421 VWEEPDKFKP ERFEATEGET ERFNYKLVPF GMGRRACPGA DMGLRAVSLA LGALIQCFDW 
4 81 QIEEAESLEE SYNSRMTMQN KPLKWCTPR EDLGQLLSQL 
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NAME D267-AF10 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 295 

1 AACATCCTTT CCTTCTTCCA AAAATGGAGC TTCAATCTTC TCCTTTCAAT TTAATTTCTT 
61 TGTTCCTCTT CTTTTCTTTT CTTTTTATTC TAGTGAAGAA ATGGAATGCC AAAATCCCAA 
121 AGTTACCTCC AGGTCCGTGG AGGCTTCCCT TTATTGGAAG CCTCCATCAC TTGAAGGGAA 
181 AACTTCCACA CCATAATCTT AGAGATCTAG CGCGAAAATA TGGACCTCTC ATGTACTTAC 
241 AACTCGGAGA AATTCCTGTA GTTGTAATAT CTTCGCCACG TGTAGCAAAA GCTGTACTAA 
301 AAACTCATGA TCTCGCTTTT GCAACTAGAC CACGATTCAT GTCCTCAGAC ATTGTGTTTT 
361 ACAAAAGCAG GGACATCTCT TTTGCCCCAT TTGGTGATTA CTGGAGACAG ATGCGTAAAA 
421 TATTGACTCA GGAACTCCTG AGCAACAAGA TGCTCAAGTC ATATAGCTTA ATCCGAAAGG 
481 ATGAGCTCTC GAAGCTCCTC TCATCGATTC GTTTGGAAAC AGGTTCTGCA GTGAACATAA 
541 ATGAAAAGCT TCTCTGGTTT ACGAGCTGCA TGACCTGTAG ATTAGCCTTT GGAAAAATAT 
601 GCAATGATCG GGATGAGTTG ATCATGCTAA TTAGGGAGAT ATTAACATTA TCAGGAGGAT 
661 TTGATGTGGG TGATTTGTTC CCTTCCTGGA AATTACTTCA TAATATGAGC AACATGAAAG 
721 CTAGGTTGAC GAATGTACAC CACAAGTATG ATTTAGTTAT GGAGAACATC ATCAATGAGC 
781 AC CAAGAGAA TCATGCAGCA GGGATAAAGG GTAACAACGA GTTTGGTGGC GAAGATATGA 
841 TCGATGCTCT ACTGAGGGCT AAGGAGAATA ATGAGCTTCA ATTTCCTATC GAAAATGACA 
901 ACATGAAAGC AGTAATTCTG GACTTGTTTA TTGCTGGAAC TGAAACTTCA TATACTGCAA 
961 TTATATGGGC ACTAT CAGAA TTGATGAAGC ACCCAAGTGT GATGGCCAAG GCACAAGCTG 
1021 AAGTGAGAAA AGTCTTCAAA GAAAATGAAA ATTTCGACGA AAATGATCTT GACAAGTTGC 
1081 CATAC CTAAA ATCAGTGATT AAAGAAACAC TAAGGATGCA CCCTCCAGTT CCTTTGTTAG 
1141 GGCCTAGAGA ATGCAGGGAC CAAACAGAGA TCGATGGCTA CACTGTACCT ATTAAAGCTA 
1201 GAGTTATGGT TAATGCTTGG GCGATAGGAA GAGATCCTGA AAGTTGGGAA GATCCTGAAA 
1261 GTTTCAAACC GGAGCGATTT GAAAATACTT CTGTTGATCT TACAGGAAAT CACTATCAGT 
1321 TCATTCCTTT CGGTTCAGGA AGAAGAATGT GTCCAGGAAT GTCGTTTGGT TTAGTTAACA 
1381 CAGGGCATCC TTTAGCCCAG TTGCTCTATT GCTTTGACTG GAAACTCCCT GACAAGGTTA 
1441 ATGCAAATGA TTTTCGCACT ACTGAAACAA GTAGAGTTTT TGCAGCAAGC AAAGATGACC 
1501 TCTACTTGAT TCCCACAAAT CACAGGGAGC AAGAATAGCT TAATTTAATG GAGTTCTTGG 
1561 AAGAATTAAA GAAGAAGGGC TATATAGGTG AGATTTTTTG TATGGTTGCA AGGTTTTTAG 
1621 TTCATACAAT AAGACAATAC ATTATATTCC AGTATTGTGT ATCATGTATA ATAAGGTTCC 
1681 TTTTGTTTAA AAAA 



1 MELQSSPFNL ISLFLFFSFL FILVKKWNAK IPKLPPGPWR LPFIGSLHHL KGKLPHHNLR 
61 DLARKYGPLM YLQLGEIPW VISSPRVAKA VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 
121 APFGDYWRQM RKILTQELLS NKMLKSYSLI RKDELSKLLS SIRLETGSAV NINEKLLWFT 
181 SCMTCRLAFG KICNDRDELI MLIREILTLS GGFDVGDLFP SWKLLHNMSN MKARLTNVHH 
241 KYDLVMENII NEHQENHAAG IKGNNEFGGE DMIDALLRAK ENNELQFPIE NDNMKAVILD 
301 LFIAGTETSY TAIIWALSEL MKHPSVMAKA QAEVRKVFKE NENFDENDLD KLPYLKSVIK 
361 ETLRMHPPVP LLGPRECRDQ TEIDGYTVPI KARVMVNAWA IGRDPESWED PESFKPERFE 
421 NTSVDLTGNH YQFIPFGSGR RMCPGMSFGL WTGHPLAQL LYCFDWKLPD KVNANDFRTT 
481 ETSRVFAASK DDLYLIPTNH REQE 
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NAME D284-AH5 
ORGANISM N I COT I ANA TABACUM 
SEQ. ID. NO. 297 

1 CAATCAGTGG ATGCGGGAGT AATATATAAT ATGCAAGTTG TAGAAAGAGA AAAAAAAAAT 
61 CAAGTAGCTA TTCTATACTG GGGCACAAAT AGTGAGTGAA AATGGAGACT GTTCAAATCA 
121 TAATAACAGC ATCTTGTGCT GCCATAATAA TTACTCTAGT GGTGTGTATT TGGAGAGTAC 
181 TGAATTGGGT TTGGTTCAGA CCAAAGAAGC TGGAAAAACT ATTGAGGAAA CAAGGTCTCA 
241 AAGGCAACTC CTACAAGATT TTGTATGGGG ATATGAAGGA GCTTTCTGGT ATGATTAAGG 
301 AAGCTAATTC CAAACCCATG AATCTTTCTG ATGATATTGC ACCAAGATTG GTGCCTTTCT 
361 TTCTTGACAC CATCAAGAAA TATGGTAAAA AATCCTTTGT ATGGTTAGGT CCGAAACCAC 
421 TGGTTCTTAT CATGGACCCT GAGCTTATAA AGGAAATATT TTCCAAATAC TATCTGTATC 
481 AAAAGCCTCA TGGAAATCCA GTTACCAAGC TATTAGTACA AGGACTAGTA AGCCTAGAGG 
541 AAGACAAATG GGCCAAACAT AGAAAAATCA TCAATCCAGC TTTCCATCTA GAGAAGCTAA 
601 AGCATATGCT TCCAGCTTTT TGCTTGAGCT GCACTGAGAT GCTGTGCAAA TGGGAAGATA 
661 TTGTTTCAAT TAAGGGCTCA CATGAGATAG ATGTATGGCC TCACCTTGAA CAATTAAGTA 
721 GCGATGTGAT CTCTCGGACA GCTTTTGGCA GTAACTTTGA AGAAGGTAAA AGGATATTTG 
781 AACTTCAGAA GGAACAAGCT CAGTATTTTG TAGAAGCTAT ACGCTCGGTT TATATACCAG 
841 GCTGGAGGTT TTTGCCAACA AAGAGGAACA GAAGAATGAA GGAAGTTGAA AAGGATGTTC 
901 GGGCCTCGAT AAGAGGCATT ATTGATAAAA GAGTGAAGGC AATGAAAGCA GGAGAGGCGA 
961 GTAATGAGGA TCTACTTGGT ATATTGTTGG AATCTAATTT TACAGAAGCT GAACAGCATA 
1021 GACACAAGGA TTCTGCGATG AGCATTGAAG AAGTCATTCA AGAATGCAAG TTATTCTATG 
1081 TTGCTGGCCA AGAAACTACA TCAGTGTTGC TTGTGTGGAC TCTAATATTG TTGAGTAGGC 
1141 ATCAAGATTG GCAGAGCCGA GCCAGAGAAG AGGTGTTTCA AGTCTTTGGT AATCAGAAAC 
1201 CAGATTTTGA CGGATTGAAT CGTCTAAAAG TTGTGACAAT GATCTTGTAT GAGTCTTTAA 
1261 GGCTATACTC CCCAGTAGTG TCACTAATCC GGCGGCCTAA TGAGGATGCT ATATTAGGAA 
1321 ATGTATCTCT GCCAGAAGGT GTGCTACTCT CATTACCAGT GATCTTATTA CACCACGATG 
1381 AAGAGATATG GGGTAAAGAT GCAAAGAAGT TCAATCCAGA AAGATTTAGA GATGGAGTCT 
1441 CAAGTGCAAC AAAGGGTCAA GTCACTTTTT TTCCATTTAC TTGGGGTCCC AGAATATGCA 
1501 TCGGACAAAA TTTTGCCATG TTAGAAGCAA AGACTGCTTT GGCTATGATC CTACAACGCT 
1561 TCTCATTCGA ACTGTCTCCA TCTTATGCAC ATGCTCCTCA GTCCATATTA ACTATGCAAC 
1621 CCCAACATGG TGCTCCACTA ATTCTGCACA AAATATAGTT TGTTACTTTA AGCAGTGTCT 
1681 TGTTATATGT CAGAGAGTCC AAAATGTTTA ATTAAGGCTT GTAGAACTGC CAAATGGAAC 
1741 TTCATTTGCA TTCGTGGGTT GTAGATTGTT GTAATTGGAC AAGTATACTG TTTATTTTAG 
1801 AGTTTTAAGA AAAAAAAAA 

SEQ. ID. NO. 298 

1 METVQIIITA SCAAIIITLV VCIWRVLNWV WFRPKKLEKL LRKQGLKGNS YKILYGDMKE 
61 LSGMIKEANS KPMNLSDDIA PRLVPFFLDT IKKYGKKSFV WLGPKPLVLI MDPELIKEIF 
121 SKYYLYQKPH GNPVTKLLVQ GLVSLEEDKW AKHRKIINPA FHLEKLKHML PAFCLSCTEM 
181 LCKWEDIVSI KGSHEIDVWP HLEQLSSDVI SRTAFGSNFE EGKRIFELQK EQAQYFVEAI 
241 RSVYIPGWRF LPTKRNRRMK EVEKDVRASI RGIIDKRVKA MKAGEASNED LLGILLESNF 
301 TEAEQHRHKD SAMSIEEVIQ ECKLFYVAGQ ETTSVLLVWT LILLSRHQDW QSRAREEVFQ 
361 VFGNQKPDFD GLNRLKWTM ILYESLRLYS PWSLIRRPN EDAILGNVSL PEGVLLSLPV 
421 ILLHHDEEIW GKDAKKFNPE RFRDGVSSAT KGQVTFFPFT WGPRICIGQN FAMLEAKTAL 
481 AMILQRFSFE LSPSYAHAPQ SILTMQPQHG APLILHKI 



Figure 150: Amino Acid Identity of Group Members 



Group 1 

AQIAINLWSMLGHLLHHFTWAPAPGVNPEa>IDLEESPGTWYMKNPIQAIPTPRLPAHLYGRVPVDM 

I 

AQ LAI N LVT SMLGH LL HH FTWAP P P GVN PENIDLEESP GTVT YMKN P I QAI PT P RL PAH L YGRVP VDM 



SEQ ID No. :2 D58-BG7 
(98.5) 

SEQ ID No. :4 D58-AB1 



Group 2 

QLAINLVTSMLGHLFIILHGLRPRGLTRRILTWRRALEQ 



SEQ ID No. :8 D58-BE4 



Group 3 

EGLAVRMVALSLGC 1 1 QC FDWQRI GEELVDMT EGTGLT LP KAQP LVAKCS P RPKMANLLSQI 

I lit 
EGLAIRMVALSLGCIIQCFDWQRLGEGLVDKTEGTGLTLPKAQPLVAKCSPRPIMANLLSQI 



SEQ ID No.: 10 D56-AH7 
(93.5) 

SEQ ID No. i 12 Dl3a-5 



Group 4 

I GFAT LVTHLT FGRLLQGFD FSKP SNTP I DMTEGVGVT LP KVNQVEVLI TPRLPS KL YLF 

I I II 

INFATLVTHLT FGRLLQG FDFS TP SNTP I DMTEGVGVTLP KVNQVEVLI S PRLPS KLYVF 



SEQ ID No.:14 D56-AG10 
(93.3) 

SEQ ID No. :18 D34-62 



Group 5 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVLKPRSF 

I 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVMKPRSF 

I I 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVLKPRSC 



SEQ ID No. :20 D56-AA7 
(98.2) 

SEQ ID No.: 144 D185-BD3 
(96.4) 

SEQ ID No. :22 D56-AE1 



Group 6 

IALGVASMELALSNLLYAFDWELPFGMKKEDIDTNARPGITMHKKNELYLIPKNYL 

II I I 

IALGVASMELALSNLLYAFDWELPYGVKKENIDTNVRPGITMHKKNELCLIPRNYL 

I I 

IALGVASMELALSNLLYAFDWELPYGVKKEDIDTNVRPGIAMHKKNELCLVPKNYL 

I I I 

IALGVASMELALSNLLYAFDWELPYGVKKEDIDTNVRPGIAMHKKNELCLVPKKLFINYIGTWISC 



SEQ ID No. :24 D35-BB7 
(92.8) 

SEQ ID No.:26 D177-BA7 
(96.4) 

SEQ ID No.:28 D56A-AB6 
(94.6) 

SEQ ID No.: 30 D144-AE2 



Group 7 

ISFGLANAYLPLAQLLYHFDWELPTGIKPSDLDLTELVGVTAARKSDLYLVATPYQPPQN 
III I 
ISFGLANAYLPLAQLLYHFDWKLPAGIEPSDLDLTELVGVTAARKSDLYLVATPYQPPQK 



SEQ ID No.: 32 D56-AG11 
(93.3) 

SEQ ID No. :34 D179-AA1 



Group 8 

MLFGLANVGQPLAQLLYHFDWKLPNGQSHENFDMTESPGISATRKDDLVLIATPYDSY 

II I II 

MLFGLANVGQPLAQLLYHFDWKLPNGQTHQNFDMTESPGISATRKDDLILIATPAHS 



SEQ ID No. :36 D56-AC7 
(91.2) 

SEQ ID No.:38 D144-AD1 



Group 9 

LLFGLVNVGHPLAQLLYHFDWKTLPGISSDSFDMTETDGVTAGRKDDLCLIATPFGLN 



SEQ ID No.:40 D144-AB5 



Group 10 

MS FGLVNTGHPLAQLLYFFDWKFPHKVNAADFHTTETSRVFAAS KDDLYLI PTNHMEQE 

I II II I 

MS FGLVNTGHPLAQLLYCFDWKLPDKVNANDFRTTETSRVFAAS KDDLYLI PTNHREQE 



SEQ ID No.:42 D181-AB5 
(89.8) 

SEQ ID No. :44 D73-Ac9 



Group 11 

MQFGLALVTLPLAHLLHNFDWKLPEGINARDLDMTEANGISARREKDLYLIATPYVSPLD 



SEQ ID No. :46 D56-AC12 



Group 12 

MTYALQVEHLTMAHLIQGFNYRTPTDEPLDMKEGAGITIRKVNPVKVIITPRIiAPELY 

III II I 

MTYALQVEHLTMAHLIQGFNYKTPNDEALDMKEGAGITIRKVNPVELIIAPRLAPELY 

I 

MTYALQVEHLTMAHLIQGFNYKTPNDEALDMKEGAGITIRKVNPVELIITPRLAPELY 

I I I 

MTYALQVEHLTMAHLIQGFNYRTPNDEPLDMKEGAGITIRKVNPVELIIAPRLAPELY 

I 

MTYALQVEHLTMAHLIQGFNYRTPNDEPLDMKEGAGITIRKVNPVELIIAP-LAPELY 

I 

MTYALQVEHLTMAHLIQGFNYRTPNDEPLDMKEGAGITIRKVNPAELIIAPRLAPELY 
II II Mill 

MTYALQVEHLTIAHLIQGFNYKTPNDEPLDMKEGAGLTIRKVNPVEVTTTARLAPELY 

I 

MT YALQVEH LT I AHLI QG FN YKT PNDEP LDMKEGAGLT I RKVNPVEVTITARLAP EL Y 



SEQ ID No. :48 D58-AB9 
(89.6) 

SEQ ID No. :50 D56-AG9 
(98.2) 

SEQ ID No. :52 D56-AG6 
(94.8) 

SEQ ID No.: 54 D35-BG11 
(98.3) 

SEQ ID No. :56 D35-42 
(98.3) 

SEQ ID No. :58 D35-BA3 
(84.5) 

SEQ ID No. :60 D34-57 
(98.3) 

SEQ ID No. :62 D34-52 



Group 13 

YSLGLKVIRVTLANMLHGFNWKLPEGMKPEDISVEEHYGLTTHPKFPVPVILESRLSSDLYSPIT 



SEQ ID No. :66 D56-AD10 



Group 14 

YSLGIRIIRATIANLLHGFNWRLPNGMSPEDISMEEIYGLITHPKVALDVMMEPRLPNHLYK 



SEQ ID No. :68 D56-AA11 



Group 15 

INFSIPLVELALANLLFHYNWSLPEGMLAKDVDMEEl/UjGITMHKKSPLCLVASHYTC 

I II 

INFS I PLVELALANLLFHYNWSLPEGMLPKDVDMEEALGITMHKKSPLCLVASHYNLL 



SEQ ID No. :70 D177-BD5 
(94.7) 

SEQ ID No. :84 D177-BD7 



Group 16 

MQLGLYALEMAVAHLLLCFTWELPDGMKPSELKMDDIFGLTAPRANRLVAVPSPRLLCPLY 

I I 
MQ LG L YALEMAVAH LLHC FTWE L P DGMK PSELKMDDI FGLTAPRAN RLVAVP TPRLLCPLY 

I 

MQLGLYALEMAVAHLLHCFTWELPDGMKPSELKMDDIFGLTAPKANRLVAVPTPRLLCPLY 



SEQ ID No. :74 D58-BC5 
(96.7) 

SEQ ID No.:76 D58-AD12 
(98.4) 

SEQ ID No.:72 D56A-AG10 



Group 17 

MLWSAS I VRVS YLT CI YRFQVYAGS VFRVA 



SEQ ID No. :78 D56-AC11 



I 

MLWSAS IVRVS YLTCI YRFQVYAGSVSRVA 



(96.7) 

SEQ ID No.: 88 D56-AD6F 



Group 18 

LNFAMLEAKMALALI LQHYAFELS PS YAHAPHTI ITLQPQHGAPLI LRKL 



SEQ ID No.:90 D73A-AD6 



Group 19 

QNFAILEAKMAIAMI LQRFS FELS PS YTHSPY TWTLKPKYGAPLIMHRL 

I It I I I I I I I I I II 

QNFAMLEAKMALSMI LQRFS FELS PS YAHAPQSILTVQPQYGAPLI FHKL 
I I II I lit I 

INFAMTEAKMAMAMI LQRFS FELS P S YTHAPQSVITMQPQ YGAP LI LHKL 
I 

INFAMAEAKMAMAMILQRFSFELSPSYTHAPQSVITMQPQYGAPLILHKL 

i i i ii mi ii mi 

QN FAMMEAKMAVAMI LHKFS FELS P SYTHAPFAIVTIHPQ YGAP LLMRRL 
I 

QNFAMMEAKMAVAMILQKFSFELSPSYTHAPFAIVTIHPQYGAPLLMRRL 



Group 20 

QNFAMLEAKMAMAMILKTYAFELSPSYAHAPHPLLLQPQYGAQLILYKL 



SEQ ID No. 


:96 


D70A-AB5 










(72.0) 


SEQ 


ID 


No. : 


100 


D70A-AB8 










(82.0) 


SEQ 


ID 


No. : 


102 


D70A-BH2 










(98.0) 


SEQ 


ID 


No. : 


104 


D70A-AA4 










(70.0) 


SEQ 


ID 


No. : 


108 


D70A-BA9 










(98.0) 


SEQ 


ID 


No. : 


106 


D70A-BA1 


SEQ 


ID 


No. : 


110 


D70A-BD4 



Group 21 

YSMGLKAIQASLANLLHGFNWSLPDNMTPEDLNMDEIFGLSTPKKFPLATVIEPRLSPKLYSV 
I I 

YSLGLKEIQASLANLLHGFNWSLPDNMTPEDLNMDEIFGLSTPKKFPLATVIEPRLSPKLYSV 
I I 

HSLGLKVIQASLANLLHGFNWSLPDNMTPEDLNMDEIFGLSTPKKFPLATVIEPRLSPKLYSV 



SEQ ID No.: 112 D181-AC5 
(96.8) 

SEQ ID No.: 114 D144-AH1 
(96.8) 

SEQ ID No. :116 D34-65 



Group 22 

LCFPCLISSYILALNVNLYHNFLQISPSISY 



SEQ ID No.: 118 D35-BG2 



Group 23 

S GLAQC WGLALAT LVQC FEWKRVS EEWDLTEGKGLTMP KP EP LMARCEARDI FHKVLS E I S 



SEQ ID No.:120 D73A-AH7 



Group 24 

LGLATVHWLMLARMIQEFEWSAYPENRKVDLLRNWNLLW 

I I I I I I I I I 

LGLATVHVNLMLARMI QEFEWSAYP ENRKVD FTEKLEFTWMKN PLRAKVKP RMQ W 
I 

LGLATVHVNLMLARTI QE FEW SAYPENRKVD FTEKLEFTWMKN PLRAKVKP RMQ W 



SEQ ID No.: 136 D185-BG2 
(77.5) 

SEQ ID No.: 122 D58-AA1 
(98.2) 

SEQ ID No.: 134 D185-BC1 



Group 25 

YALAMLHLEYFVANLVWHFRWEAVEGDDVDLSEKLEFTWMKNPLRARICPRVNSI 



SEQ ID No.: 124 D73A-AE10 



Group 26 

QQVGLLRTTI FI ASLLS E YKLKPRSHQKQVELTDLNPASWLHS I KGELLVDAI PRKKAAF SEQ ID No.: 126 D56A-AC12 
Group 27 

ITFAKFVNELALARLMFHFDFSLPKGVKHEDLDVEEAAGITVRRKFPLLAVATPCS SEQ ID No.: 128 D177-BF7 

I (98.2) 

ITFAKFWEIALARLMFHFDFSLPKGVKHADLDVEEAAGITVRRKFPLLAVATPCS SEQ ID No.: 140 D185-BD2 

Group 28 

QRYAINHLMLFIALFTALIDFKRHKTDGCDDIAYIPTIAPKDDCKVFLSQRCTRFPSFS SEQ ID No.: 130 D73A-AG3 

Group 29 

MSFGLANLYLPLAQLLYHFDWKLPTGIKPRDLDLTELSGITIARKGDLYLNATPYQPSRE SEQ ID No.: 132 D70A-AA12 

I III I I I I I I I I (80.0) 

ISFGIANVYLPLAQLLYHFDWKLPTGINSSDLDMTESSGVTCARKSDLYLTATPYQLSQE SEQ ID No.: 86 176-BF2 

Group 30 SEQ ID No.:146 D176-BC3 
QNFAMLEAKTTLAMILQRFSFELSPSYAHAPQSIITCNPSM\fijHLFC^ 

Group 31 

ADMGLRAVS LALGALI QC FDWQI EEAES LEES YNS RMTMQNKPLKWCTPREDLGQLLSQL SEQ ID No.: 148 D176-BB3 

Group 32 

MNYSLQVEHLSIAHMIQGFSFATTTNEPLDMKQGVGLTLPKKTDVEVLITPRLPPTLYQY SEQ ID No . : 6 D186-AH4 



The percentage identity between most related pairs is noted in (0.0%). Each group had at 
least 70% identity to another group member. Group 19 contained the lowest percentage 
identity at 70.0%. 



. FIGURE 151: COMPARISON OF SEQUENCE GROUPS 



ALIGNMENT OF GROUP 1 

D58-BG7 GCACAACTTGCTATCAACTTGGTCACATCTATGTTGGGTCATTTGTTGCATCATTTTACA SEQ ID No 1 

I 

D58-AB1 GCACAACTTGCTATCAACTTGGTCACATCTATGTTGGGTCATTTGTTGCATCATTTTACG SEQ ID No 3 

I 

D5B-BE4 GCACAACTTGCTATCAACTTGGTCACATCTATGTTGGGTCATTTGTT-CATCATTTTACA SEQ ID No 7 

***************************************** ****** ************ 

D58 -BG7 TGGGCTCCGGCCCCGGGGGTTAACCCGGAGGATATTGACTTGGAGGAGAGCCCTGGAACA 

I I 
D58-AB1 TGGGCTCCGCCCCCGGGGGTTAACCCGGAGAATATTGACTTGGAGGAGAGCCCTGGAACA 

I I 
D5 8 -BE4 TGGGCTCCGGCCCCGGGGGTTAACCCGGAGGATATTGACTTGGAGGAGAGCCCTGGAACA 
********* ******************** ***************************** 

D5 8 -BG7 GTAACTTACATGAAAAATCCAATACAAGCTATTCCAACTCCAAGATTGCCTGCACACTTG 

I 

D5 8 -AB 1 GTAACTTACATGAAAAATCCAATACAAGCTATTCCTACTCCAAGATTGCCTGCACACTTG] 

I I I I I I I I I I I M I I I t I I I I I I I t I I I I I | I | I I I | t I I 1 I I I I M 
D58-BE4 GTAACTTACATGA 

D5 8 -BG7 TATGGACGTGTGCCAGTGGATATGTAA 

D58-AB1 TATGGACGTGTGCCAGTGGATATGTAA 
I I I I I I I I I I I t I I I t I I t I I I I I I I I 
D5B-BE4 

PERCENT IDENTITY OF GROUP 1 

D58-BG7 D58-BE4 D58-AB1 

D58-BG7 + + * 96.2 98.1 SBQ ID NO 1 

D58-BE4 *** 94.0 SEQ JD NO 7 

D58-AB1 *** SEQ ID No 3 

ALIGNMENT OF GROUP 2 

D56-AH7 GAAGGATTGGCTGTTCGAATGGTTGCCTTGTCATTGGGATGTATTATTCAATGTTTTGAT SEQ ID No 9 

1 I ( 

D13a-5 GAAGGATTGGCTATTCGAATGGTTGCATTGTCATTGGGATGTATTATTCAATGCTTTGAT SEQ ID No 11 

D56-AH7 TGGCAACGAATCGGCGAAGAATTGGTTGATATGACTGAAGGAACTGGACTTACTTTGCCT 

III I I 
Dl 3 a- 5 TGGCAACGACTTGGGGAAGGATTGGTTGATAAGACTGAAGGAACTGGACTTACTTTGCCT 
********* * ** **** *********** **************************** 

D56-AH7 AAAGCTCAACCTTTGGTGGCCAAGTGTAGCCCACGACCTAAAATGGCTAATCTTCTCTCT 

I I I 

Dl 3 a- 5 AAAGCTCAACCTTTAGTGGCCAAGTGTAGCCCACGACCTATAATGGCTAATCTTCTTTCT 
************** ************************* *************** *** 

D56-AH7 CAGATTTGA 

D13a-5 CAGATTTGA 

PERCENT IDENTITY OF GROUP 2 

D56-AH7 D13a-5 

D56-AH7 *** 93.7 SEQ ID No 9 

D13a-5 *** SEQ ID No 11 



FIGURE 151: COMPARISON OF SEQUENCE GROUPS 



ALIGNMENT OF GROUP 3 

D56-AG10 ATAGGTTTTGCGACTTTAGTGACACATCTGACTTTTGGTCGCTTGCTTCAAGGTTTTGAT SEQ ID No 13 

I 

D35-33 ATAGGCTTTGCGACTTTAGTGACACATCTGACTTTTGGTCGCTTGCTTCAAGGTTTTGAT SEQ ID No 15 

I I I 

D34-62 ATAAATTTTGCGACTTTAGTGACACATCTGACTTTTGGTCGCTTGCTTCAAGGTTTTGAT SEQ ID No 17 

D56-AG1 0 TTTAGTAAGCCATCAAACACGCCAATTGACATGACAGAAGGCGTAGGCGTTACTTTGCCT 

i I 

D34 -62 TTTAGTACGCCATCAAACACGCCAATAGACATGACAGAAGGCGTAGGCGTTACTTTGCCT 
******* ****************** ********************************* 

D56-AG1 0 AAGGTTAATCAAGTTGAAGTTCTAATTACCCCTCGTTTACCTTCTAAGCTTTATTTATTTTGA 

I I 

D3 5 - 3 3 AAGGTTAATCAAGTTGAAGTTCTAATTACCCCTCGTTTACCTTCTAAGCTTTAT TTAT 

II I t It 

D34-62 AAGGTAAATCAAGTGGAAGTTCTAATTAGCCCTCGTTTACCTTCTAAGCTTTATGTATTCTGA 
***** ******** ************* ************************* *** 

PERCENT IDENTITY OF GROUP 3 

D56-AG10 D35-33 D34-62 



D56-AG10 *** 98.9 95.1 SEQ ID No 13 

D35-33 *** 94.4 SEQ ID No 15 

D34-62 *** SEQ ID No 17 

ALIGNMENT OF GROUP 4 

D56-AA7 ATTATAC TTGCATTGCCAATTCT TGGCATCAC TTTGGGACGTTTGGTTCAGAACTTTGAG 

I 

D56 -AE1 ATTATACTTGCATTGCCAATTCTTGGCATTACTTTGGGACGTTTGGTTCAGAACTTTGAG 

II til 
Dl 6 5-BD3 ATTATCCTTGCACTGCCAATTCTTGGCATTACCTTGGGACGCTTGGTGCAGAACTTTGAG 
***** ****** **************** ** ******** ***** ************ 

D56-AA7 CTGTTGCCTCCTCCAGGCCAGTCGAAGCTCGACACCACAGAGAAAGGTGGACAGTTCAGT 

D56-AE1 CTGTT GCCTCCTCCAGGCCAGTC GAAGCTCGACACC ACAGAGAAAGGTGGACAGTTCAGT 

I I I I I II 

D185-BD3 

D5 6 -AA7 CTCCACATTTTGAAGCATTCCACCATTGTGTTGAAACCAAGGTCTTTCTGA 
I 

D5 6 -AE1 CTCCATATTTTGAAGCATTCCACCATTGTGTTGAAACXAAGGTCTTGCTGA 

II I I II I 

Dl 8 5-BD3 CTGCACATTTTGAAGCATTCCACCATTGTGATGAAACCAAGATCTTTTTAA 
** ** ************************* ********** **** * * 

PERCENT IDENTITY OF GROUP 4 

D56AA7 D56-AE1 D185-BD3 



D56AA7 +** 98.2 87.7 SEQ ID No 19 

D56-AE1 *** 87.1 SEQ ID No 21 

D185-BD3 *** SEQ ID No 143 



FIGURE 151: COMPARISON OF SEQUENCE GROUPS 

ALIGNMENT OF GROUP 5 



D56A-AB6 ATTGCACTTGGGGTTGCATCCATGGAACTTGCTTTGTCAAATCTTCTTTATGCATTTGAT SEQ ID No 27 

I I 

D35-BB7 ATTGCACTTGGGGTTGCATCAATGGAACTTGCATTGTCAAATCTTCTTTATGCATTTGAT SEQ ID No 23 

( I 

D177-BA7 ATTGCACTTGGGGTTGCATCCATGGAACTTGCTTTGTCAAATCTTCTTTATGCATTTGAT SEQ ID No 25 

D14 4-AE2 ATTGCAC TTGGGGTT GC ATCC ATGGAACTTGCTTTGTCAAATCTTCTT TATGCATTTGAT SEQ ID No 29 

D56A-AB6 TGGGAGTTGCCTTATGGAGTGAAAAAAGAAGACATCGACACAAACGTTAGGCCTGGAATT 

III I II 

D35-BB7 T GGGAGT T ACC TT TT GG AATG AAAAAAGAAGACATTGAC ACAAACGCC AGGCCT GGAATT 

III I II 

D177-BA7 TGGGAGTTACCTTACGGAGTGAAAAAAGAAAACATTGACACAAATGTCAGGCCTGGAATT 

II III 
D14 4-AE2 TGGGAGTTGCCTTATGGAGTGAAAAAAGAAGACATCGACACAAACGTTAGGCCTGGAATT 



D56A-AB6 
D35-BB7 
D177-BA7 
D144-AE2 



GCCATGCACAAGAAAAACGAACTTTGCCTTGTCCCAAAAAA-TTATTTATAA 

It II I I II 

ACCAT GCATAAGAAAAACGAACTTTATCTTATCCCTAAAAA- TT ATCT ATAG 

I I I 

ACCATGCATAAGAAAAACGAACTTTGCCTTATCCCTAGAAA-TTATCTATAG 

I I I I I I I I I I It I 



********** 



**** * *** **** 



D56A-AB6 



D177-BA7 



I I I f I I I I I I I I I I I ! I I I I I I I I 
ATTGGGACGTGGATCTCATGCTAG 



D14 4-AE2 
I 

PERCENT IDENTITY OF GROUP 5 
1 



D56A-AB6 



D56A-AB6 
D35-BB7 
D144-AE2 
D177-BA7 



90.6 
*** 



97.1 
87.7 



91.8 
93.0 
88.9 



SEQ ID No 27 
SEQ ID No 23 
SEQ ID No 29 
SEQ ID No 25 



ALIGNMENT OF GROUP 6 



D56-AG11 
D179-AA1 



ATTTCGTTTGGTTTAGCTAATGCTTATTTGCCATTGGCTCAATTACTTTATCACTTTGAT 

I I I 

ATTTCGTTTGGCTTAGCTAATGCTTATTTGCCATTGGCTCAATTACTATATCACTTCGAT 
*********** *********************************** ******** *** 



D56-AG1 1 
D179-AA1 



TGGGAAC TCCCCACT GGAATCAAACCAAGCGACT TGGACTTGACTGAGTTGGTTGGAGTA 

I II I 

TGGAAACTCCCTGCTGGAATCGAACCAAGCGACTTGGACTTGACTGAGTTGGTTGGAGTA 
*** ******* ******** ************************************** 



D56-AG11 
D17 9-AA1 



ACTGCCGCTAGAAAAAGTGACCTTTACTTGGTTGCGACTCCTTATCAACCTCCTCAAAACTGA 

I 

ACTGCCGCTAGAAAAAGTGACCTTTACTTGGTTGCGACTCCTTATCAACCTCCTCAAAAGTGA 
*********************************************************** *** 



FIGURE 151: COMPARISON OP SEQUENCE GROUPS 



PERCENT IDENTITY OF GROUP 6 

SEQ ID No 31 SEQ ID No 33 

D56-AG11 D17 9-AA1 

D56-AG11 *** 95.6 SEQ ID No 31 

D179-AA1 *** SEQ ID No 33 

ALIGNMENT OF GROUP 7 

D56-AC7 ATGCTATTTGGTTTAGCTAATGTTGGACAACCTTTAGCTCAGTTACTTTATCACTTCGAT SEQ ID No 35 

D14 4-AD1 ATGCTATTTGGTTTAGCTAATGTTGGACAACCTTTAGCTCAGTTACTTTATCACTTCGAT SEQ ID No 37 

D56-AC7 TGGAAACTCCCTAATGGACAAAGTCATGAGAATTTCGACATGACTGAGTCACCTGGAATT 

I III 

D14 4-AD1 TGGAAACTCCCTAATGGACAAACTCACCAAAATTTCGACATGACTGAGTCACCTGGAATT 
********************** *** * ****************************** 

D56-AC7 TCTGCTACAAGAAAGGATGATCTTGTTTTGATTGCCACTCCTTATGATTCTTATTAA 



1 n i ( 



Dl 4 4 -AD 1 TCTGCTACAAGAAAGGATGATCTTATTTTGATTGCCACTCCTGCTCATTCTTGA 

************************ ***************** * ****** 
PERCENT IDENTITY OF GROUP 7 

D144-AD1 D56-AC7 
D144-AD1 *** ~ 94.3 SEQ ID No 37 

D56-AC7F *** SEQ ID No 35 



I D«l«t«d: ^ 



ALIGNMENT OF GROUP 9 

D1B1-AB5 ATGTCGTTTGGTTTAGTTAACACTGGGCATCCTTTAGCTCAGTTGCTCTATTTCTTTGAC SEQ ID No 41 

I I I 

D73-AC9 ATGTCGTTTGGTTTAGTTAACACAGGGCATCCTTTAGCCCAGTTGCTCTATTGCTTTGAC SEQ ID No 43 

*********************** ************** ************* ******* 

D18 1-AB5 TGGAAATTCCCTCATAAGGTTAATGCAGCTGATTTTCACACTACTGAAACAAGTAGAGTT 

ill II I 

D 7 3 -AC 9 T GGAAACTCCC TGACAAGGTTAATGCAAATGATTTTCGCACTACTGAAACAAGTAGAGTT 

****** ***** * ************ ******** ********************** 

D18 1-AB5 TTTGCAGCAAGCAAAGATGACCTCTACTTGATTCCAACAAATCACATGGAGCAAGAGTAG 

I I I 

D 7 3 -AC 9 TTTGCAGCAAGCAAAGATGACCTCTAC TTGATTCCC ACAAATCACAGGGAGCAAGAATAG 

*********************************** ********** ********* *** 

PERCENT IDENTITY OF GROUP 9 

D181-ABS D73-AC9 
LU81-aB5 *** 92.8 seq id no 41 

D73-AC9 *** SEQ ID NO 43 
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ALIGNMENT OF GROUP 11 



D58- 


-AB9 


ATG AC TT ATGC AT TGC AAGTG GAAC AC C T AAC AATG GC ACAT TT GATCC AGGGTTT CAAT 








47 


D56- 


-AG 9 


1 

ATGAC TT ATGC AT T GCAAGT G GAAC ACCTAAC AATG GC ACATTT AATCCAGGGT TTCAAT 










D35-BGU 


1 1 

A 1 UnL 1 In l<JLnl 1 Ijv*/\M\j 1 u<JrV/v^rVC 1 1 MnLKHi vjLjOU^H ill U-M 1 \A##vUlV) 111 UAA 1 










D34-25 


i II I 

A T fl AP TT A Tf^P AT T A P A A <VT(Z A AP APPT A AP & A T A CP ftp AT TT flh TPP AfVIflTTTr A AT 










D35- 


-BA3 


I II I 

AT G AC TT ATGC AT TGC AAGTG GAAC AC TT AAC AATGGC AC AT TT GATCC AAGGT TTCAAT 


SEQ 


ID 


No 


57 


D34-52 


1 II 1 
ATGAC TTATGCATTACAAGTGGAACACCT AAC AATAGC ACATTT GATCCAGGGTTTCAAT 


SEQ 


ID 


No 


61 


D56- 


-AG6 


1 i 1 
ATG ACTT AT GC AT TGC AAGTGGAAC AC CT AAC AATG GC ACATTT AATC C AGG GT TTCAAT 


SEQ 


ID 


No 


51 


D35-42 


1 1 1 










D34- 


-57 


1 t 1 1 

ATGACTTATGCAT TACAAGTGGAACACCTAACAATAGCACATTTGATCCAGGGT TTCAAT 










D58-AB9 












D56-AG9 


III 1 

TACAAAACTCCAAATGACGAGGCCTTGGATATGAAGGAAGGTGCAGGCATAACTATACGT 










D35- 


-BG11 


1 1 

TAPAflAAPTPPAA ATHAPfiAflPPPTTrtnATATnA A/^rZAArZ^TrV* A^i^T l ATAAr*TATAr , ^T 

l I M 

T AC AAAACTCC AAATGACGAGCCCC TG GAT AT GAAG GAAGGT GC AG GATT AACT AT ACGT 

1 1 II 
TACAGAACTCCAAATGACGAGCCCTTGGATATGAAGGAAGGTGCAGGCATAACTATACGT 










D34 


-25 










D35' 


-BA3 










D34- 


-52 


1 1 1 
TACAAAACTCCAAATGACGAGCCCTTGGATATGAAGGAAGGTGCAGGATTAACTATACGT 

1 1 1 
TACAAAACTCC AAATGACGAGGCCT TGGATAT GAAGGAAGGTGC AGGCATAACAATACGT 










D56- 


-AG6 










D35 


-42 


1 1 1 
TACAGAACTCCAAATGACGAGCCCTTGGATATGAAGGAAGGTGCAGGCATAACTATACGT 










D34 


-57 


1 III 
TAC AAAACTCC AAATGACGAGCCCTTGGATATGAAGGAAGGTGCAGGATTAACCATACGT 
**** ******** *** *** ** ********** *********** **** ****** 










D58 


-AB9 


AAGGTAAATO:TGTGAAAGTGATAATTACGCCTCGCTTGGCACCTGAGCTTTATTAA 










D56 


-AG9 


t 1 1 1 










D35 


-BG11 


AAGGTAAATCCTGTGGAACTGAT AATAGC GCCTCGCCTGGCACCTGAGCTTTATTAA 

1 II II 1 
AAAGTAAATCCTGTAGAAGTGACAATTACGGCTC GCCTGGCACC TGAGCTTTAT TAA 

1 II 1 1 i 
AAGGT AAATCC TGCGGAACTGAT AATAGCGCC TC GCCTGGCACC TGAGCTTT ATTAA 

1 III Mi 
AAAGTAAATCCTGTAGAAGTGACAATTACGGCTCGCCTGGCACCTGAGCTTT ATTAA 










D34 


-25 










D35 


-BA3 










D34- 


-52 










D56 


-AG6 


1 1 1 1 1 1 1 1 
AAGGT AAATCCAGTGGAATTGATAATAACGCCTCGCTTGGCACCTGAGCTTTACTAA 

1 i 1 1 1 1 1 1 
AAGGT AAATCC TGTGGAACTGATAATAGCGCCCC TGGCACCTGAGCTTT ATTAA 

1 1 1 1 1 1 1 1 1 1 1 
AAAGTAAATCCTGTAGAAGTGACAACTACGGCTCGCCTGGCACCTGAGCTTTATTAA 










D35 


-42 










D34 


-57 











** ******** * ** *** ** ** * * **************** *** 



FIGURE 151: COMPARISON OF SEQUENCE GROUPS 

PERCENT IDENTITY OF GROUP 11 

D58-AB9 D56-A36 D35-42 D34-57 D34-25 





D56-SG9 


D35-BGL1 




D35-BA3 


D34-52 


D58-AB9 *** 


93.8 93.2 94.3 


90.8 93.2 


90.9 


92.0 


91.5 


SEQ ID No 47 


D56-A39 


*** 96.6 97.2 


94.2 96.6 


91.5 


92.6 


92.0 


SEQ ID No 49 


D56-A36 


*** 93.8 


90.2 92.6 


90.3 


90.9 


90.3 


SEQ ID No 51 


D35-BGL1 




97.1 99.4 


90.9 


92.0 


91.5 


SEQ ID No 53 


D35-42 




*** 96.5 


87.3 


88.4 


87.9 


SEQ ID No 55 


D35-BA3 




*** 


90.3 


91.5 


90.9 


SEQ ID No 57 


D34-57 






*** 


98.9 


98.3 


SEQ ID No 59 


D34-52 








*+* 


99.4 


SEQ ID No 61 


D34-25 










*#* 


SEQ ID No 63 



ALIGNMENT OF GROUP 14 

D177-BD7 ATTAATTTTTCAATACCACTTGTTGAGCTTGCACTTGCTAATCTATTGTTTCATTATAAT SEQ ID No 83 

D17 7-BD5 ATTAATTTTTCAATACCACTTGTTGAGCTTGCACTTGCTAATCTATTGTTTCATTATAAT SEQ ID No 69 

************************************************************ 

D177-BD7 TGGTCACTTCCTGAGGGGATGCTACCTAAGGATGTTGATATGGAAGAAGCTTTGGGGATT 
t I 

D17 7 -BD5 TGGTCACTTCCTGAAGGGATGCTAGCTAAGGATGTTGATATGGAAGAAGCTTTGGGGATT 
************** ********* *********************************** 

D17 7-BD7 ACCATGCACAAGAAATCTCCCCTTTGCTTAGTAGCTTCTCATTATAACTTGTTGTGA 

I I If 

D17 7-BD5 ACCATGCACAAGAAATCTCCCCTTTGCTTAGTAGCTTCTCATTATA-CTTGTTGA — 

| **********************+****+****************** ******* 

PERCENT IDENTITY OF GROUP 14 

D177-BD7 D177-BD5 

D177-BD7 *** 96.0 SEQ ID No 83 

D177-BD5 *** SEQ ID No 69 



ALIGNMENT OF GROUP 15 

D56A-AG10 ATGCAACTTGGGCTTTATGCATTGGAAATGGCTGTGGCCCATCTTCTTCATTGTTTTACT SEQ ID No 71 



D58-AD12 ATGCAACTTGGGCTTTATGCATTGGAAATGGCTGTGGCCCATCTTCTTCATTGTTTTACT SEQ ID No 75 

I t II 

D58-BC5 ATGCAACTTGGGCTTTATGCATTAGAAATGGCAGTGGCCCATCTTCTTCTTTGCTTTACT SEQ ID No 73 

*********************** ******** **************** *** ****** 

D56A-AG1 0 TGGGAATTGCCAGATGGTATGAAACCAAGTGAGCTTAAAATGGATGATATTTTTGGACTC 

D 5 8 -AD1 2 TGGGAATTGCCAGATGGTATGAAACCAAGTGAGCTTAAAATGGATGATATTTTTGGACTC 

D58 -BC5 TGGGAATTGCCAGATGGTATGAAACCAAGTGAGCTTAAAATGGATGATATTTTTGGACTC 
************************************************************ 

I 

D5 8 -AD1 2 ACTGCTCCAAGAGCTAATCGACTCGTGGCTGTGCCTACTCCACGTTTGTTGTGTCCCCTT 

I I I 

D58 -BC5 ACTGCTCCAAGAGCTAATCGACTCGTGGCTGTGCCTAGTCCACGTTTGTTGTGCCCACTT 



FIGURE 151: COMPARISON OF SEQUENCE GROUPS 



D56A-AG10 TATTAA 
D58-AD12 TATTAA 
D58-BC5 TATTAA 

PERCENT IDENTITY OF GROUP 15 



D56-BC5 



D56A-AG10 *** 99.5 95.7 SEQ ID No 71 

D58-AD12 *** 96.2 SEQ ID No 75 

D58-BC5 *** SEQ ID No 7 3 

ALIGNMENT OF GROUP 16 

D56-AD6 ATGCTTTGGAGTGCGAGTATAGTGCGCGTCAGCTACCTAACTTGTATTTATAGATTCCAA SEQ ID No 87 

D56-AC11 ATGCTTTGGAGTGCGAGTATAGTGCGCGTCAGCTACCTAACTTGTATTTATAGATTCCAA SEQ ID No 77 

D35-39 ATGCT TT GGAGTGCGAGTATAGTGC GCGTCAGCTACCTAACTTGTATTTATAGATTCCAA SEQ ID No 79 

I 

D58-BH4 ATGCTTT GGAGTGCGAGTATAGTGC GCGTCAGCTACCTAACCTGTATTTATAGATTCCAA SEQ ID No 81 

***************************************** ****************** 

D^6-AC11 GTATATGCTGGGTCTGTGTTCAGAGTAGCATGAp35- 3 9 -( DelMad'i 

GTATATGCTGGGTCTGTGTTCAGAGTAGCATGA """ ^ 

D58 -BH4 GTATATGCTGGGTCTGTGTTCAGAGTAGCATGA 
******************* ************* 



PERCENT IDENTITY OF GROUP 16 



D56-AC11 D56-AD6 D58-BH4 D35-39 

D56-AC11 *** 98.7 98.7 98.7 SEQ ID No 77 

D56-AD6 *** 98.7 98.7 SEQ ID No 87 

D5B-BH4 *** 98.7 SEQ ID No 81 

D35-39 *** SEQ ID No 79 

ALIGNMENT OF GROUP 17 

D7 3A-AD6 CTGAATTTTGCAATGTTAGAGGCAAAAATGGCACTTGCATTGATTCTACAACACTATGCT SEQ ID No 89 

D70A-BA11 CTGAATTTTGCAATGTTAGAGGCAAAAATGGCACTTGCATTGATTCTACAACACTATGCT SEQ ID No 91 

********************************** ** **************** * ******* 

D7 3 A- AD 6 TTTGAGCTCTCTCCATCTTATGCACATGCTCCTCATACAATTATCACTCTGCAACCTCAA 

I 

D70A-BA1 1 TTTGAGCTCTCTCCATCTTATGCACACGCTCCTCATACAATTATCACTCTGCAACCTCAA 

D73A-AD6 C ATGGTGCTCC TTTGATTTTGCGCAAGCTGTAG 

D70A-BA1 1 CATGGTGCTCCTTTGATTTTGCGCAAGCTGTAG 



FIGURE 151: 

PERCENT IDENTITY OF GROUP 17 
D7 3A-AD 70A-BA11 



COMPARISON OF SEQUENCE GROUPS 



D73A-AD6 *** 99.3 
D70A-BA11 *** 
I 

ALIGNMENT OF GROUP 18 



SEQ ID No B9 
SEQ ID NO 91 



D70A-AB5 
D70A-AAB 



CAAAACTTCGCGATTTTGGAAGCAAAAATGGCTATAGCTATGATTCTACAACGCTTCTCC 
CAAAACTTCGCGATTTTGGAAGCAAAAATGGCTATAGCTATGATTCTACAACGCTTCTCC 



SEQ ID No 95 
SEQ ID No 97 



D70A-AB5 
D70A-AA8 



TTCGAGCTCTCCCCATCTTATACACACTCTCCATACACTGTGGTCACTTTGAAACCCAAA 
f 

T TCGAGCTCTCTCCATCTTAT ACAC ACTCTCCATAC AC TGTGGTCACTTTGAAACCCAAA 



Dl,0A-AB5 
D70A-AA8 



TATGGTGCTCCCCTAATAATGCACAGGCTGTAG^ 
TATGGTGCTCCCCTAATAATGCACAGGCTGTAG " 



PERCENT IDENTITY OF GROUP 18 



D70A-AB5 



D70A-AB5 *** 
D70A-AA8 



D70A-AA8 



99.6 
* + * 



SEQ ID No 95 
SEQ ID No 97 



AUGMENT CF (3QOP 19 

D70A-AB8 
D70A-BH2 



D70A-AA4 



CAAAATTTTGCCATGTTAGAAGCAAAGATGGCTCTGTCTATGATCCTGCAACGCTTCTCT 
II I I II I II I I 

ATAAACTTTGCAATGACAGAAGCGAAGATGGCTATGGCTATGATTCTGCAACGCTTCTCC 



ATAAACTTTGCAATGGCAGAAGCGAAGATGGCTATGGCTATGATTCTGCAACGCTTCTCC 
*** ***** *** ****** **** ** ** ******* ** *********** 



SEQ ID NO 99 
SEQ ID No 101 
SEQ ID NO 103 



D70A-AB8 TTTGAACTGTCTCCGTCTTATGCACATGCCCCTCAGTCCATATTAACCGT-CAGCCACAA 

II I I II I I I I I I I 

D70A-BH2 TTTGAGCTATCTCCATCTTACACACATGCTCCACAGTCTGTAATAACTATGCAACCCCAA 

D70A-AA4 TTTGAGCTATC TCCATCTTACACACAT GCTCC ACAGTCTGTAATAACT ATGCAACCCCAA 



D70A-AB8 

D-|0A-BH2 
D70A-AA4 



TATGGTGCTCCACTTATTTTCCACAAGCTATAA 

I II III 

TATGGTGCTCCTCTTATATTGCACAAATTGTAA^ 



******* ** 



Deleted: i 



PERCENT IDENTITY OF GROUP 19 

P7QA-AB8 D7QA-AA4 P7QA-PH? 



D7QA-AB8 *** 
D7 OA-AM 
D7QA-BH2 



77.8 



77.8 
99.3 



SEQ ID No 99 
SEQ ID No 101 
SEQ ID No 103 



AEJOyEOT CP GROT 2Q 



FIGURE 151: COMPARISON OF SEQUENCE GROUPS 

D70A-BA1 CAAAACTTTGCAATGATGGAAGCAAAAATGGCAGTAGCTATGATACTACAAAAATTTTCC SEQ ID No 105 

I 

D70A-BA9 CAAAACTTTGCAATGATGGAAGCAAAAATGGCAGTAGCTATGATACTACATAAATTTTCC SEQ ID No 107 

D70A-BA1 T TT GAACTATCCCCTTCTTATACACATGCTCCATTTGCAATTGTGACTATTCATCCTCAG 

D7 0 A-BA 9 TTTGAACTATCCCCTTCTTATACACATGCTCCATTTGCAATTGTGACTATTCATCCTCAG 

D70A-BA1 
D70A-BA9 

************************* 
PERCENT IDENTITY OF GROUP 20 

PIOA-BAl D70A-BA9 

D70A-BA1 *** 99.4 SEQ ID No 105 

D70A-BA9 *** SEQ ID No 107 

I 

AUaMNT CP ggXP 22 

D14 4-AH1 TATAGCTTGGGGCTCAAGGAGATTCAAGCTAGCTTAGCTAATCTTCTACATGGATTTAAC SEQ ID No 113 

I I 

D34-65 CATAGCTTGGGGCTCAAGGTGATTCAAGCTAGCTTAGCTAATCTTCTACATGGATTTAAC SEQ ID No 115 

I I I 

D181-AC5 TATAGCATGGGGCTCAAGGCGATTCAAGCTAGCTTAGCTAATCTTCTACATGGATTTAAC SEQ ID No 111 

Dl 4 4-AH1 TGGTCATTGCCTGATAATATGACTCCTGAGGACCTCAACATGGATGAGATTTTTGGGCTC 
D3 4 -6 5 TGGTCATTGCCTGATAATATGACTCCTGAGGACCTCAACATGGATGAGATTTTTGGGCTC 
D1B1-AC5 TGGTCATTGCCTGATAATATGACTCCTGAGGACCTCAACATGGATGAGATTTTTGGGCTC 

Dl 4 4 -AH1 TCTACACCTAAAAAATTTCCACTTGCTACTGTGATTGAGCCAAGACTTTCACCAAAACTT 

D34 -65 TCTACACCTAAAAAATTTCCACT TGCTACTGTGATTGAGCCAAGAC TT TCACCAAAACTT 

D16 1-AC5 TCTACACCTAAAAAATTTCCACTTGCTACTGTGATTGAGCCAAGACTT TCACCAAAACTT 

******* ************************* ***■**+**++ ****************** 

D14 4-AH1 TACTCTGTTTGA 

D34-65 TACTCTGTTTGA 

D181-AC5 

p|rcent IDENTITY OF GROUP 22 

D34-65 D181-AC5 D144-AH1 

D34-65 *** 98.4 99.0 seq id no lis 

D181-A3 *** 99.0 seq id No 111 

D144-JM1 *** seq ID No 113 

tfmzmw cp <ycp? 25 

-** ] Deleted: ]| 

D58-AA1 TTGGGCT TGGCAACGGTGCATGTGAAT TTGATGT TGGCCCGAATGATTCAAGAATT TGAA SEQ ID No 121 



FIGURE 151: COMPARISON OF SEQUENCE GROUPS 

D185-BC1 T T GGGCT TGGC AACG GT GC AT GTGAAT TTGAT GT TGGCCCGAACGA TTCAAG AATT TG AA SEQ ID No 133 

I 

D18 5-BG2 TTGGGCT TGGC AACGGTGCAT GTGAAT TTGAT GTTGGCCCGAATGATTCAAGAATTTGAA SEQ ID No 135 

D5B-AA1 TGGTCCGCTTACCCGGAAAATAGGAAAGTGGATTTTACTGAGAAATTGGAATTTACTGTG 

D185-BC1 TGGTCCGCTTACCCGGAAAATAGGAAAGTGGATTTTACTGAGAAATTGGAATTTACTGTG 

I 

D185-BG2 TGGTCCGCTTACCCGGAAAAT AGGAAAGTGGATTT- ACTGAGAAATTGGAATTTACTGTG 

D58-AA1 G TG AT G AAAAA TC CT TT AAGAGC TAAG GTCAAGC CAAGAATGC AAGT G GTGT AA 

I 

D185-BC1 GT G AT GAAAAACCCT T T AAGAGC T AAGGTC AAGCC AAG AATGCAAG TG GTGT AA 

i I t I I f I I I I I M I I I I I I I I I I I I I I I I I I I | | | I I I I I M I 

D185-BG2 GTGA 

PERCENT IDENTITY OF GROUP 25 

D56-AA1 D185-BG2 D1B5-BC1 

D58-AA1 *** 95.9 98.9 SEQ ID No 121 

D185-BG2 *** 95.1 SEQ ID No 135 

D185-BC1 *** SEQ ID No 133 

aLeQMEUT CP gjgg 28 

D17 7-BF7 ATCACATTTGCTAAGTTTGTGAATGAGCTAGCATTGGCAAGATTAATGTTCCATTTTGAT SEQ ID No 127 

D185-BD2 ATCACATTTGCTAAGTTTGTGAATGAGCTAGCATTGGCAAGATTAATGTTCCATTTTGAT SEQ ID No 139 

D185-BE1 ATCACATTTGCTAAGTTTGTGAATGAGCTAGCATTGGCAAGATTAATGTTCCATTTTGAT SEQ ID No 137 

********************************* *************************** 

Dl 7 7-BF7 TTCTCGCTACCAAAAGGAGTTAAGCATGAGGATTTGGACGTGGAGGAAGCTGCTGGAATT 

I 

Dl 8 5-BD2 TTCTCGCTACCAAAAGGAGTTAAGCATGCGGATTTGGACGTGGAGGAAGCTGCTGGAATT 

I 

Dl 8 5-BE 1 TTCTCGCTACCAAAAGGAGTTAAGCATGAGGATTTGGACGTGGAGGAAGCTGCTGGAATT 
**************************** ******************************* 

Dl 7 7 -BF7 ACTGTTAGAAGGAAGTTCCCCCTTTTAGCCGTCGCCACTCCATGCTCGTGA 

Dl 8 5-BD2 ACTGTTAGAAGGAAGTTCCCCCTTTTAGCCGTCGCCACTCCATGCTCGTGA 
I 

Dl 8 5-BE1 ACTGTTAGGAGGAAGTTCCCCCTTTTAGCCGTCGCCACTCCATGCTCGTGA 
******** ****************************************** 

PERCENT IDENTITY OF GROUP 28 



P177-BT7 D185-BD2 D185-HR1 

D177-BF7 *** 99.4 99.4 seq id no 127 

D185-ED2 *** 98.8 seq id no 139 

D185-BEL *** SEQ ID No 137 



AUQMMT CP CBOGP 30 



FIGURE 151: COMPARISON OP SEQUENCE GROUPS 



D70A-AA12 ATGTCATTTGGTTTAGCTAATCTTTACTTACCATTGGCTCAATTACTCTATCACTTTGAC SEQ ID No 131 

* I I I I I I fit I I 

D17 6-BF2 ATATCATTTGGTTTGGCTAATGT TTAT TTGCCACTAGCTCAATTGTTATATCATTTTGAT SEQ ID No 85 

** *********** ****** **** ** *** * ******** * ***** ***** 

D7 OA-AA 1 2 TGGAAACTCCCAACCGGAATCAAGCCAAGAGACTTGGACTTGACCGAATTATCGGGAATA 

II M I I I I I I I I 

Dl 7 6- BF2 TGGAAACTCCCTACTGGAATCAATTCAAGTGACTTGGACATGACTGAGTCGTCAGGAGTA 
*********** ** ******** **** ********* ***, ** * «* *»* ** 

D70A-AA12 ACTATTGCTAGAAAGGGTGACCTTTACTTAAATGCTACTCCTTATCAACCTTCTCGAGAGTAA 

I I I III II I III 

D17 6-BF2 ACT TGTGCTAGAAAGAGTGATTTATACTTGACTGCTACTCCATATCAACTTTCTCAAGAGTGA 



P|RCENT IDENTITY OF GROUP 30 

D176-BF2 D7QA-AA12 



D176-BF2 
D7 OA-AA 12 



77.0 
*** 



SEQ ID No 85 
SEQ ID No 131 
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Figure 153: Cloning of Cytochrome P450 cDNA Fragments by PCR 
| GRRXCP(A/G) 
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I = Deoxyllnosine; Y = C, T; M = A,C; R = A,G; S = C,G; N= A,T,C,G 



